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We consider Hermitian and symmetric random band matrices = (h xy ) in d 1 dimensions. The 
matrix entries h xy , indexed by x, y £ (Z/LZ) d , are independent, centred random variables with variances 
s xy = ^Z\h xy \ 2 . We assume that s xy is negligible if \x — y\ exceeds the band width W. In one dimension 
J^j we prove that the eigenvectors of H are delocalized if W 3> L 4,/5 . We also show that the magnitude 

of the matrix entries |Ga,y| 2 of the resolvent G = G(z) — (H — z)^ 1 is self- averaging and we compute 
E|G X j,| 2 . We show that, as L — > oo and W > L 4/5 , the behaviour of E|G xy | is governed by a diffusion 
operator whose diffusion constant we compute. Similar results are obtained in higher dimensions. 
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1. Introduction 



Random band matrices H = {h xy ) XtVe Y represent quantum systems on a large finite graph T with random 
quantum transition amplitudes effective up to distances of order W. The matrix entries are independent, 
centred random variables. The variance s xy := E|/i X j,| 2 depends on the distance between the two sites x and 
y, and it typically decays with the distance on a characteristic length scale W, called the band width of H. 
This terminology comes from the simplest one-dimensional model where the graph T = {1,2,..., N} is a 
path on N vertices, and the matrix entries h xy are negligible if \x — y\ > W. In particular, if W — N and 
all variances are equal, we recover the well-known Wigner matrix, which corresponds to a mean-field model. 
Higher-dimensional models are obtained if V taken to be the box of linear size L in Z d . In this case the 
dimension of the matrix is N = L d . 

Typically, W is a mesoscopic scale, larger than the lattice spacing but smaller than the diameter L of 
the system: 1 <C W <C L. These models are natural interpolations between random Schrodinger operators 
with short range quantum transitions such as the Anderson model [3j and mean-field random matrices 



such as Wigner matrices 39 . In particular, random band matrices may be used to model the Anderson 
metal-insulator phase transition, which we briefly outline. 

The key physical parameter of all these models is the localization length £, which describes the typical 
length scale of the eigenvectors of H. The system is said to be delocalized if the localization length is 
comparable with the system size, I ~ L, and it is localized otherwise. Delocalized systems arc electric 
conductors, while localized systems are insulators. 

Nonrigorous supersymmetric calculations 27 show that for random band matrices the localization length 
is of order £ ~ W 2 in d = 1 dimension. In d = 2, the localization length is expected to be exponentially 
growing in W, and in d 3, it is macroscopic, I ~ L, i.e. the system is delocalized. We refer to the overview 
papers of Spencer 35 37 and to the paper of Schenker 32 for more details on these conjectures. 

These predictions are in accordance with those for the Anderson model, where the random matrix is 
of the form —A + XV; here A is the lattice Laplacian, V a random potential (i.e. a diagonal matrix with 
i.i.d. entries), and A a small coupling constant. The localization length is £ ~ A~ 2 in the regime of strong 
localization, which corresponds to the whole spectrum for d = 1 and a neighbourhood of the spectral edges 
for d > 1. This result follows from the rigorous multiscale analysis of Frohlich and Spencer [26] as well as 
from the fractional moment method of Aizenman and Molchanov [2] . The two-dimensional Anderson model 
is conjectured to be in the weak localization regime with £ ~ exp(A -2 ) throughout the spectrum [lj, but this 
has so far not been proved. In dimensions d 3, the prediction is that there is a threshold energy, called the 
mobility edge, E , that separates the localized regime near the band edges from the delocalized regime in the 
bulk spectrum. The localization length is expected to diverge as the energy E approaches the mobility edge 
from the localization side. The increase of the localization length as an inverse power of E — E has been 
rigorously established up to a certain scale in [5 ,36, but this analysis does not allow E to actually reach 
the conjectured value of Eq. A key open question for the Anderson model is to establish the metal- insulator 
transition, i.e. to show that the mobility edge indeed exists. 

For random band matrices, the metal-insulator transition can be investigated even in d — I by varying 
the band width W. The prediction that the localization length £ is of order W 2 can be recast in the form that 
the eigenvectors are delocalized if W ^ L 1 / 2 . Currently only lower and upper bounds have been established 



for I. On the side of localization, Schenker 32 proved that £ ^ W , uniformly in the system size, by 
extending the methods of the proofs of the Anderson localization for random Schrodinger operators. As a 
lower bound, £ ^ W was proved in [23| by using a self-consistent equation for the diagonal matrix entries 
G xx of the Green function G = G(z) = (H — In particular, Wigner matrices (W — L) are completely 
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delocalized; in fact this has been proven earlier in 18 . 19 21 using a simpler self-consistent equation for 
the trace of the Green function, TrG. This lower bound was improved to I ^ W 7 ^ 6 in [6] 7 by using 
diagrammatic perturbation theory. In fact, not only was the lower bound on localization length established, 
but it was also shown that the unitary time evolution, e ltH , behaves diffusively on the spatial scale W, 
i.e. the typical propagation distance is \ftW . Thus, the mechanism responsible for the derealization of 
random band matrices is a random walk (in fact, a superposition of random walks) with step size of order 
W. Showing that the localization length is greater than the naive size W therefore requires a control of 
the random walk for large times. For technical reasons, in [6j[7] the time evolution could only be controlled 
up to time t ^ W 1 ^ 3 , which corresponds to derealization on the scale W 1+1 ^ 6 . The work IgIIt] was partly 
motivated by a similar results for the long-time evolution for the Anderson model 15-17 combined with an 



algebraic renormalization using Chebyshev polynomials 26 33 



In the current paper we develop a new self-consistent equation (sec (4.2|)) which keeps track of all matrix 



entries of the Green function G, and not only the diagonal ones as in 
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We show that 



\G xy \ 2 is self- 



averaging and E|Ga;j,| 2 behaves as the resolvent of a diffusion operator associated with a superposition of 
random walks with step size of order W. This result can then be translated into a lower bound on the 
localization length. 

More precisely, for d= 1 we obtain full control on \G xy {z)\ 2 for relatively broad bands, W A 4 / 5 , and 
for j] — Imz ^ (W/N) 2 . The condition W ^> A 4 / 5 is technical. The condition on r\ comes from the facts 
that t = rf~ x corresponds to the time scale of the random walk, and a random walk with step size W in a box 
of size N reaches equilibrium in a time of order (A/W) 2 . As a corollary, we prove that most eigenvectors 
arc delocalized if W > A 4 / 5 . This improves the exponent in (g|[t] , where derealization for W ^> A 6 / 7 was 
proved. However, unlike in [6||7], here we do not obtain a lower bound on the localization length t uniformly 
in A. We also prove analogous results in higher dimensions. In addition, we investigate the case where the 
variances s xy of the matrix entries decay slowly according to the power law \x ~ for < j3 < 2. 

In this regime the system exhibits superdiffusive behaviour. In particular, we may allow decay of the form 
\x — y\~ 2 , which is critical in the sense of 29 . 

One key open question for random band matrices is to control the resolvent G(z) for i] = Imz <C W~ x . 
None of the results mentioned above yield a nontrivial control below W~ x . In the regime r\ ^ W^ 1 robust 

For ^ETrG(z) and for r\ > W~° ", 
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pointwise bounds on G xy have been obtained with high probability 

a more precise error estimate was derived for a special class of Bernoulli entries in [34] . However, controlling 
the quantity -^ETrG(z) does not yield information on the localization length. In the current paper we 
obtain much more precise bounds on IGzjJ 2 in the regime r\ (W/N) 2 W^ 1 ! 2 , which in particular imply 
delocalization bounds. 

Supersymmetric (SUSY) methods offer a very attractive approach to studying the derealization transi- 
tion in band matrices, but rigorous control of the ensuing functional integrals away from the saddle points 
is difficult. This task has been performed for the density of states of a special three-dimensional Gaussian 
model [4]; this is the only result where a nontrivial control for 77 ^ W~ Y (in fact, uniform in rj) was obtained. 
The SUSY method has so far only been applied to the expectation of single Green function, EG, and not to 
its square, E|G| 2 . 

The analysis of the trace of the single Green function yields the limiting spectral density of H which is 
the Wigner semicircle law provided the band width W diverges as L — > 00. For band matrices the semicircle 
law on large scales, corresponding to spectral parameter 77 > independent of A, was given in 30 . More 



recently, a semicircle law on small scales, in which rj 1, was derived in 23 and generalized in [9]. The 



results of |9l 23] are summarized in Lemma 3.4 below. As an application of our method, we prove a further 



improvement of the semiricle law in Theorem |2.2| below 



3 



The main new ingredient in this paper is the self-consistent equation for the matrix T, whose entries 



T 



|G, 



are local averages of IG^yl 2 . We show in Theorem 4.1 below that T satisfies a self-consistent equation of the 
form 

T = \m\ 2 ST + \m\ 2 S + £, (1.1) 
where S is the matrix of variances (s xy ), m = m(z) is an explicit function of the spectral parameter z = E+irj 



(see (2.14) below), and £ is an error term. Neglecting the error term £, we obtain 

\m\ 2 S 



T 



1 - \m\ 2 S 



In this paper we implement the band structure of H u sing a symmetric probability density / on m '' 
requiring that s xy rj W~ d f((i — j)/W) (see Section 



2.1 



by 



below for the precise statement). Using translation 
invariancc of S and the Taylor expansion of its Fourier transform S(p) in the low momentum regime, we 
obtain for \p\ < W^ 1 that 

S(p) « l-W 2 (p-Dp) + --- , (1.2) 



of T, we use |m| = 1 — at] + 0(rf) (see (3.5) below), where 



where D is the matrix of second moments of / (see (8.1 1 below). In order to give the leading-order behaviour 



a = a(E) := 



s/A-E 2 



(E = Rez) 



Therefore the Fourier transform of T is approximately given by 



V + W 2 (p-D cS p) 



where 



D ce := 



D 

a 



(1.3) 



(1.4) 



in the regime \p\ <^ W~ x and 77 <§C 1 . This corresponds to the diffusion approximation on scales larger than 
W with an effective diffusion constant -D e ff . In the language of diagrammatic perturbation theory, the change 
from D to D e g has the interpretation of a self-energy renormalization. This result coincides with Equation 
(1.5.5) of [35] , which was obtained by computing the sum of ladder diagrams in a high-moment expansion. 

The main result of this paper is a justification of this heuristic argument in a certain range of parameters. 
The error term £ contains fluctuations of local averages. Roughly speaking, we need to control the size of 
J2 X [\G xy \ 2 — P x \G xy \ 2 ] , where P x denotes partial expectation with respect to the matrix entries in the x-th 
row (see T xy in (4.5 1 below). Unfortunately, (G^yl 2 and IG^/yl 2 for x ^ x' are not independent; in fact they are 



strongly correlated for small 77, and they do not behave like independent random variables. Estimating high 
moments of these averages requires an unwrapping of the hierarchical correlation structure among several 
resolvent matrix entries. The necessary estimates are quite involved. They are a special case of the more 
general Fluctuation Averaging Theorem that is published separately |8j, and was originally developed for 



25 Lemma 



application in the current paper. There have been several previous results in this direction; see 
5.2], [24j Lemma 4.1], [10, Theorem 5.6], and 31 Theorem 3.2]. The Fluctuation Averaging Theorem 



generalizes these ideas to arbitrary monomials of G and exploits an additional cancellation mechanism in 
averages of IG^yj 2 that is not present in averages of G xx . For more details, see [8]. 
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2. Formulation of the results 



2.1. Setup. Fix d £ N and let / be a smooth and symmetric (i.e. f(x) = f(—x)) probability density on R d . 
Let L and W be integers satisfying 

L s ^ W sC L (2.1) 

for some fixed S > 0. The parameter L is the fundamental large quantity of our model. Define the d- 
dimcnsional discrete torus 

T d L := [-L/2,L/2) d C\Z d . 

Thus, has N :— L d lattice points. For the following we fix an (arbitrary) ordering of T^, which allows 
us to identify it with {1, . . . , N}. We define the canonical representative of i € 1 d through 

\i] L := (i + LZ d )DT d L , 

and introduce the periodic distance 

where |-| denotes Euclidean distance in Mr. 

Define the N x N matrix S(L, W) = S = (s^ € T d L ) through 

- 1 f( [ ^), (2-2) 



Z L ,w V W 

where Zl^w is a normalization constant chosen so that S is a stochastic matrix: 

E s « = 1 (2.3) 

3 

for all i € T^. Unless specified otherwise, summations are always over the set T£. By symmetry of / we 
find that S is symmetric: Sjj = Sji. As a stochastic matrix, the spectrum of S lies in [—1,1]. In fact it is 
proved in Lemma A.l of 23 that there exists a positive constant S, depending only on /, such that 

- 1 + 5 ^ S^l. (2.4) 

We let (Qj : i ^ j), where i,j € T^, be a family of independent, complex- valued, centred random 

EC« = 0, E|C«| 2 = 1, C»i€M. (2.5) 



variables dj = Cjj satisfying 



For i > j we define 

Cij ■= Cji- 

We define the band matrix H — (hij) i jgTf through 

% := (^) 1/2 Cii- (2.6) 

Thus we have H = H* and 

E|/iy| 2 = (2.7) 
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In particular, we may consider the two classical symmetry classes of random matrices: real symmetric and 
complex Hermitian. For real symmetric band matrices we assume 



dj € K for all i j . 
For complex Hermitian band matrices we assume 

Ed- = for all i<j. 



(2.8) 



(2.9) 



in addition to (2.5). A common way to satisfy ( |2.9| is to choose the real and imaginary parts of Qj to be 
independent with identical variance. As in [8], our results also hold without this assumption, but we omit 
the details of this generalization to avoid needless complications. 
We introduce the parameter 

M = M N := . (2.10) 

From the definition of S it is easy to see that Z^ : w = W + 0(W d ~ 1 ). In particular, 

M = {W d + 0(W d - 1 ))/\\f\\ oo . 

We assume that the random variables Qj have finite moments, uniformly in N, i, and j, in the sense that 
for all p £ N there is a constant ^ p such that 



(2.11) 



for all N, i, and j. 

The following definition introduces a notion of a high-probability bound that is suited for our purposes. 

Definition 2.1 (Stochastic domination). Let X = (A (Ar) (u) : N e N,u e U^) be a family of random 
variables, where is a possibly N -dependent parameter set. Let ^ = (tyW)(u) '■ N € N,u <E IfW) be a 

deterministic family satisfying ^^^(u) ^ 0. We say that X is stochastically dominated by 'J, uniformly in 
u, if for all e > and D > we have 



sup . 



^ N 



-D 



\X ( - N \u)\> N e ¥ N \u) 

for large enough N ^ Nq(e,D). Unless stated otherwise, throughout this paper the stochastic domination 
will always be uniform in all parameters apart from the parameter 8 in (2.1| and the sequence of constants 
Hp in (2.11); thus, Nq(e,D) also depends on 6 and fi p . If X is stochastically dominated by ^ , uniformly in 
u, we use the equivalent notations 



X -< * 



and 



X 



For example, using Chebyshev's inequality and (2.11) one easily finds that 



(2.12) 



so that we may also write — (^((sj,-) 1 / 2 ). The relation ~< satisfies the familiar algebraic rules of order 
relations. The general statements are formulated later in Lemma |3.3| 



G 



We remark that Definition 2.1 is tailored to the assumption that (2.11) holds for any p. If (2.11) only 



holds for some large but fixed p then all of our results still hold, but in a somewhat weaker sense. Indeed, 
the control of the exceptional events in our theorems is expressed via the relation -<. If only finitely many 



moments are assumed to be finite in (2.11), then the exponents e and D in the definition of -< cannot be 



chosen to be arbitrary, and will in fact depend on p. Repeating our arguments under this weaker assumption 



would require us to follow all of these exponents through the entire proof. Our assumption that (2.11 ) holds 



for any p streamlines our statements and proofs, by avoiding the need to keep track of the precise values of 
these parameters. 

Throughout the following we make use of a spectral parameter 



E + 177 , £■ £ 



7] > . 



We choose and fix two arbitrary (small) global constants 7 > and n > 0. All of our estimates will depend 
on k and 7, and we shall often omit the explicit mention of this dependence. Set 



S = S (jv) (k, 7 ) := [E + ir] : -2 + k sC E ^ 2- k, A/ _1+7 sC 77 10} 



(2.13) 



We shall always assume that the spectral parameter z lies in S(re, 7). In this paper we always consider 
families I^fu) = x[ N \z) indexed by u = (z,i), where z € S(/t, 7) and i takes on values in some finite 
(possibly iV-dependent or empty) index set. 

We introduce the Stieltjes transform of Wigner's semicircle law, defined by 



m(z) := 



1 



2tt J - 2 £ - z 

It is well known that the Stieltjes transform m is characterized by the unique solution of 



m(z) 

with Imm(z) > for Imz > 0. Thus we have 

m(z) ■ 



1 



m{z) 



z = 



+ V~z 



(2.14) 



(2.15) 



(2.16) 



To avoid confusion, we remark that the Stieltjes transform m was denoted by m sc in the papers 10 - 14|18 - 25 



in which m had a different meaning from (2.14). 
We define the resolvent of H through 



G 



G(z) := (H-zY 



and denote its entries by Gij(z). In the following sections we list our main results on the resolvent matrix 
entries. 

We conclude this section by introducing some notation that will be used throughout the paper. We use C 
to denote a generic large positive constant, which may depend on some fixed parameters and whose value may 
change from one expression to the next. Similarly, we use c to denote a generic small positive constant. For 
two positive quantities An and Bn we sometimes use the notation An ?< Bn to mean cAn ^ Bn ^ CAn- 
Moreover, we use An «C Bn to mean that there exists a constant c > such that ^ N~ c Bn\ we also use 
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An ~> Bn to denote Bn -C An- (Note that these latter conventions are nonstandard.) Finally, we introduce 
the Japanese bracket (x) := yl + |a;| 2 . Most quantities in this paper depend on the spectral parameter z, 
which we however mostly omit from the notation. 

For simplicity, here we state our main results assuming that d — 1 and that / satisfies the decay condition 



|/(ir)| sC C n (x)- n for all n eN. 



(2.17) 



Since d = 1, we have N = L and we shall consistently use N instead of L. Similarly, M x W, and we shall 
consistently use W in estimates. We also abbreviate T N = T. 

The generalization of our results to d > 1 and slowly decaying / is straightforward, and will be given in 
Section [8] We emphasize that the core of our argument, given in Sections [3j]5] is valid in general, independent 
of the dimension. 

2.2. Improved local semicircle law for resolvent entries and derealization. Throughout this section we 
assume d — 1 and (2.171. The Wigner semicircle law states that the normalized trace, -k TrG(z), is asymp- 
totically given by m(z). In fact, this asymptotics holds even for individual matrix entries. Our first theorem 
controls the (z-dependent) random variable 

A(z) := msx\G xy (z) - 5 xy m(z)\ . 



For the following we introduce the deterministic control parameter $ = & N \z) through 



4> 2 



1 



1 



(2.18) 



Theorem 2.2 (Improved local semicircle law). Assume d=l and (2.17). Suppose moreover that 

N < W 5/4 , 7] > N 2 /W 3 . 

Then we have 



(2.19) 
(2.20) 



A 2 -< $ 2 
for z € S. 

Clearly, the assumption r\ ^> N 2 /W 3 can be replaced with the stronger assumption 77 ^> W^ 1 ^ 2 . The 



assumption N W 5 ^ 4 is technical; to see why it is needed, see (6.3) in the proof of Theorem 2.2 below. In 
the regime (2.19), Theorem 2.2 improves the earlier result 

A 2 



-< 



1 

Mr] 



(2.21) 



proved in [23] (see Lemma 3.4 below). In fact, the estimate (2.20) is optimal, as may be seen from (2.38) 
and the first estimate of (2.30) below. By spectral decomposition of G one easily finds that 



iV 2 



1 

TV 2 



Tr G*G 



1 TrG 



Im to 
Nrj 



A 



Thus, in the regime where A is bounded, the average of |Gz y | 2 is of order (Nrj) 1 . Here we introduced the 
notation G*(z) := (G(z))* — (H — z)^ 1 , which we shall use throughout the following. 
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Remark 2.3. The bound (2.201 implies an estimate on the Stieltj es t ransform of the empirical spectral 
density, mjv(z) := iV _1 TrG(z). Under the assumptions of Theorem 



2.2 



and the conditions (2.19), we have 



mj^(z) — m(z) ~< $ 2 



(2.22) 



for z 6 S. Once A -< $ is established, (2.22) easily follows from 



^^QkG kk < <f 2 ; 



(2.23) 



mechanism (see Section 3.1). A concise proof of (2.23) can be found in |9l Theorem 4 



we leave the details to the reader. We remark that (2.23) is the simplest form of the fluctuation averaging 



For r) ^ (W/N) 2 we have $ 2 = (Nrj) 1 , and the bound (2.20) therefore shows that all off-diagonal 



entries of G have a magnitude comparable with the average of their magnitudes. We say that the resolvent is 
completely delocalized. Complete delocalization of the resolvent implies that the eigenvectors are completely 
dclocalized in a weak sense. The precise formulation is gi ven in Proposition |7. 1| below. By choosing n such 



that W 1 / 2 ^ r\ ^ (W/N) 2 and invoking Proposition 7.1 wc obtain the following corollary. 



Corollary 2.4 (Eigenvector delocalization). Assume d = 1 and (|2.17[). If N < W 5 ^ then the 



eigenvectors of H are completely delocalized in the sense of Proposition \7.1\ below 



sense; see Remark 



2.7 



This corollary imp roves the result in [6ll7j , where complete eigenvector delocalization (in a slightly weaker 



below) was proved under the condition N <C W 7 ^ 6 . It was observed in Section 11 of [7] 
that the graphical perturbative renormalization scheme of [ij|7j faces a fundamental barrier at N = W 6 ^. 
The reason for this barrier is that a large family of graphs whose contribution was subleading for N -C W 6 ^ 5 
in fact yield a leading-order contribution for N ^ W 6 ^ 5 if estimated individually. The cancellation mechanism 
among these subleading graphs has so far not been identified. As evidenced by Corollary |2.4| our present 
approach goes beyond this barrier. 



2.3. Diffusion profile. In this section we assume d — 1 and (2.17). In the previous section we saw that for 



rj ^ (W/N) 2 the profile of | | 2 is essentially flat. In the complementary regime, r/ ^ (W/N) 2 , the averaged 
resolvent EIG^^j 2 is determined by a non-constant deterministic profile given by the diffusion approximation 



®xy — 



| 2 5 



1 - |ml 2 5 



(2.24) 



Note that the matrix = (Q X y) solves the equation 
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\m\ 2 Se+ \m\ 2 S. 



which is obtained from (1.1) by dropping the error term 8. Clearly, <d xy is translation invariant, i.e. Q xy 



u o with u — [x — y]pf. Moreover, Q xy > for all x, y. Indeed, this follows immediately from the geometric 
series representation 



e*» = El 



l (S n ) X y 



(2.25) 



which converges by |m| < 1 (see (|3.6|) below) and the trivial bound ^ (S n ) xy ^ 1, as follows from (2.3) 
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The representation (2.25) in fact provides the following interpretation of Q xy in terms of random walks. 
From ( 3.5 ) below we find that \m\ 2 « e~ av (recall the definition of a from ( 1.3 )). Thus the right-hand side of 
(2.25) may be approximately written as Y2n>i e ~ nari {S n ) xy . By definition, S is a doubly stochastic matrix 
- the transition matrix of a random walk on T whose steps are of size W and whose transition probabilities 
are given by p(x —¥ y) — s xy . The normalized variance of each step is given by the unrenormalized diffusion 
constant 

' u 
W . 



D = D w := 



2 ^ 



(2.26) 



(We normalize by W 2 to account for the fact that the distribution s u q has variance 0{W 2 ).) It is easy to 
see that 



D = D„ 



0(W~ 



where 



Dm := 



1 



We conclude that is a superposition of random walks up to times of order (arj)^ 1 . In this superposition 
the random walk with n steps carries a weight |m| 2n 
are strongly suppressed. The total weight of Q u q is 

E e - = Ei 

u n>l 



x 2 f(x)dx. (2.27) 

irder (ary) -1 . In this superposition 
e~ nar i , so that walks with times larger than (arj) -1 

2n « (ar?)" 1 ; (2.28) 



a precise computation is given in (5.2) below. 

The following theorem shows that an averaged version of IG^^I 2 is asymptotically given by Q xy with high 
probability. The averaging can be done in two ways. First, we can take the expectation EIG^I 2 . In fact, 
taking partial expectation P x \G xy \ 2 is enough; here P x denotes partial expectation in the randomness of the 
x-th row of H (see Definition 3.2 below). Second, we can average in the index x (or y or both) on a scale of 
W; for simplicity we consider the weighted average 



Txy — ^ $xi | Giy | 



(2.29) 



Note that T is not symmetric, but our results also hold for T xy replaced with the quantities y*. s y j | G X j | 2 or 

s xiSyj\Gij\ . 



Theorem 2.5 (Diffusion profile). Assume d = 1 and ( |2T7| . Suppose that N < W 5 ^ 4 and (W/N) 2 < 
rj 1. Then 



\T X y Qxy] ~< , 

In addition, we have the upper bounds 
and 

where we defined 
T 



P x \G xy \ 2 - S xy \m\ 2 -\m\ 2 B a 



1 xy ^ ^ xy 



\(~*xy ^xy^^ll xy i 



-< 



l 



NT) y/W 



r (K) 

xy 



1 



1 



Nn Wy/rj 



exp 



WVD 



\x - v\n 



i / Vv\x-y\N 
w\ w 



-K 



(2.30) 

(2.31) 
(2.32) 

(2.33) 



Here K is an arbitrary, fixed, positive integer. All estimates are uniform in z £ S and x,y € T. 
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Note that the total mass of the distribution IG^o | 2 m ay De computed explicitly by spectral decomposition 



of G: assuming A -< ^ we have 



E^o = Ei^«i 2 = ImGo ° 



Imm 

V 



(l + O x (*)), 



(2.34) 



in agreement with the corresponding statement (2.281 for the deterministic limiting profile. 



Remark 2.6. We expect that (2.30) should in fact hold under the weaker conditions r\ ^> -h and N <IC W 2 . 
The improved local semicircle law (2.20) should also hold under these weaker conditions. In particular, this 
would imply complete derealization of the eigenvectors for all N -C W 2 . One obstacle is that a non-trivial 
control on A in the regime 17 ^ ^ is difficult to obtain. 

Remark 2.7. In Imm a diffusion approximation was proved for E| (e~ ltH ) X y\ 2 U P to times t <C W 1 ^ 3 ; this 
result was established only in a weak sense, i.e. by integrating against a test function in x — y, living on the 
diffusive scale Wt 1 / 2 . The formula 



H - E 



e -n{H-E-i n) dt 



(2.35) 



relates the resolvent with the unitary time evolution. Notice that the time integration can be truncated 
at t ^ to with to slightly larger than rf~ . Hence, controlling the resolvent whose spectral parameter has 
imaginary part greater than 77 is basically equivalent to controlling the unitary time evolution up to time 
Although it was not explicitly worked out in [ 6jl7|, the control on e~ ltH up t o t <C W 1 ^ 3 allows one 

(combined with Theorem 2.2 ) is thus stronger than 



2.5 



to control the resolvent for r\ 3> W 1 ^ 3 . Theorem 
the results of 6, 7 in the following three senses. 

(i) The resolvent is controlled for r\ > W^ 1 ^ 2 (instead of 77 ^> W^ 1 ^ 3 ). 

(ii) The control on the profile is pointwise in x and y (instead of in a weak sense on the scale Wrj^ 1 / 2 ). 

(iii) The estimates hold with high probability (instead of in expectation). 
However, the result in the current paper is not uniform in N, unlike that of |6jf7]. 



We conclude this section with an asymptotic result on the deterministic profile Q x o- Since we are 
interested in large values of x, we need to consider the small-momentum behaviour of the Fourier transform 
of O^o- Using the small-p expansion (1.2) and (1.4), we therefore find that <d x o ~ X , where we defined the 
A^-periodic function 



111 



V ? 



1 



a-q + W 2 Dp 2 2Wy/Dc^ 



^exp 



/at] 



WVD 



kN 



(2.36) 



here the second equality follows by Poisson summation and the Fourier transform J e lpx (l+p 2 ) 1 dp = 7re l^l . 
The following proposition, proved in Appendix [SJ gives the precise statement. 



Proposition 2.8 (Deterministic diffusion profile). Assume d = 1 and (2.17). For each K e N we 
have 



'x-y 



O 



1 

W 2 



O 



K 



1 / Vv\ x - 
w\ w 



v\n 



-20 



(2.37) 
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uniformly for x, y, and z G S with r\ <C 1. 
In particular, 



■>■!) 



(2.38) 



Moreover, if (W/N) 2 sC 77 1 and N ^ W 2 , we have the sharp upper bound <d xy CT xy 



Remark 2.9. The leading-order behaviour of ( 2.37| ) is given by (2.36). If 77 -C (VF/iV) 2 then is essentially 
a constant, i.e. the profile is flat. Conversely, if 77 ^> (W/N) 2 then the leading term on the right-hand side 
of (2.36) is given by the term k — (by periodicity of 9 we assume that x € T). This is an exponentially 
decaying profile on the scale \x\ ~ Wr\~ x l 2 . The shape of the profile is therefore nontrivial if and only if 
rj ^s> (W/N) 2 . Note that in both of the above regimes the error terms in (2.37) are negligible compared with 
the main term. 

The total mass of the profile J2 x gt® x ^ s gi ven by N times the term p = in the first sum of (2.36): 



OCT] 



Imm 

1 

V 



1 + 0(7?)), 



(2.39) 



where in the last step we used the elementary identities (3.3) and (3.5) below. In fact, the calculation (2.39) 



see ([^2] below. 



We conclude that the 



is a mere consistency check (to leading order) since J2 X 

average height of the profile is of order (Nrj)^ 1 . The peak of the exponential profile has height of order 
(W^/rj) -1 , which dominates over the average height if and only if 77 ^> (W/N) 2 . The regime 77 3> (W/N) 2 
corresponds to the regime where 77 is sufficiently large that the complete derealization has not taken place, 
and the profile is mostly concentrated in the region \x — y\ ^ Wrj^ 1 / 2 <C N. 

These scenarios are best understood in a dynamical picture in which 77 is decreased down from 1. The 
ensuing dynamics of corresponds to the diffusion approximation, where the quantum problem is replaced 
with a random walk of step-size of order W. On a configuration space consisting of N sites, such a random 
walk will reach an equilibrium beyond time scales (N/W) 2 . As observed in Remark 2.7 77 _1 plays the role of 
time t, so that in this dynamical picture equilibrium is reached for t ~ rj^ 1 3> (N/W) . Figure [l] illustrates 
this diffusive spreading of the profile for different values of r\. 



Remark 2.10. The VF-dependent quantity D in the definition (2.36) may be replaced with the constant 
Doo on the right-hand side of (2.36), at the expense of a multiplicative error (1 + 0(W^ 1 log W)) and 
an additive error 0(W~ 2 ). Indeed, the replacement D i-> in the prefactor is trivial by (2.27). In 
order to estimate the error arising from the replacement D 1— > in the exponent, we use the estimate 
e -£(i+0(i/w)) = g-?^ + o(x/W)) with £ ~ ^§\x -y + kN\. If £ < C \ogW then x/W CoW^logW, 
which is small enough. On the other hand, if £ ^ Co log W then e - ^ ^ W~ c ° , so that the resulting error is 
an additive error 0(W~ 2 ). 



2.4. 



(2.17) 



Derealization with a small mean-field component. 

We now consider a related model 



In this section we continue to assume d — 1 and 



H e = (1-e) 1/2 H + ^/eU , 



(2.40) 



where H is the band matrix from Section [2. 1[ U = (uij) is a standard Wigner matrix independent of H, and 
e ^ I is a small parameter. We assume that U has the same symmetry type as H, i.e. either (2.8) or (2.9). 
Its matrix entries are normalized such that Ett,-j = and E|u 



1_ 
N- 



Moreover, in analogy to (2.11), we 
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-N/2 



N/2 -N/2 



N/2 



Figure 1. A plot of the diffusion profile function at five different values of 77, where the argument x ranges over the 

torus T. Left: the graph x h-> r/9 x (see (2.391 for the choice of normalization). Right: the graph x 1 

\2\ 



chose N 



the graph x h 
25W and 77 = 5" 



r\B x 

for k = 172^4, 5. The cases k = 1, 2, 3 (where 77 > {W/Nf 



log 8 X . Here we 
are drawn using dashed 



lines, the case k = 4 (where r\ = (W/A) ) using solid lines, and the case k — 5 (where 77 < (W/N) ) using dotted 
lines. 



make the technical assumption that for each p there exists a constant [i v such that K\N 



1/2, 



^ fip for all 



E /i 



00 1 2 



We find 



N, i, and j. 

Let 5 e = (Si^) denote the matrix of variances of the entries of H E = (h) e /), i.e. 

5 e = (1 - e)S + see* , 

where we introduced the vector e := iV _1 / 2 (l, 1, . . . , 1) T . (Hence ee* is the matrix of the variances of U .) 
Clearly, ^ S e ^ 1 and S £ is a symmetric stochastic matrix satisfying (2.3) and (2.10). 

The effect of adding a small Wigner component of size e is that the imaginary part of the spectral 
parameter effectively increases from 77 to 77 + e in the local semicircle law and in the diffusion approximation. 
In particular, we can eliminate the condition N ^ W 5 ^ 4 and still obtain delocalization for H £ provided e is 
not too small. These results are summarized in the following theorem. In order to state it, we introduce the 
control parameter 



§t := max 



1 



1 



(2.41) 



which is analogous to $ defined in (2.18). 



Theorem 2.11 (Delocalization with small mean-field component). Assume d = 1 and (2.17). The 
following estimates hold uniformly for z G S. 

(i) Suppose that 77(77 + e) 3> W^ 1 . Moreover, suppose that N < W 5 / 4 orrj + e^W' 1 / 2 . Then 

(2.42) 



A 2 -< §1. 
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(ii) Suppose that e + rj 3> W and 



1 

W(e + 77) 

Then the resolvent is completely delocalized: 



N 



(2.43) 



A -< 



IVry 



7.1 



(«i) I/e S> (1V/I4 72 ) 2 / 3 i/ien ifte eigenvectors of H e are completely delocalized in the sense of Proposition 

This theorem formulates only the bounds concerning derealization, i.e. the counterparts of Theorem |2. 2 
Similarly to Theorem 2.5 a non-trivial profile can be proved for the average of IG^yj 2 . 



2.4 



and Corollary 

The profile is visible in the regime Nr) Wy/e + 77, and it is given by 

|m| 2 S e \ |m| 2 (l-e) [ y/(l-e)ar) + e 



1 



xy 



WJD({l-s)ar 1 + s) 



: exp 



WVD 



\X - 7/| 



(2.44) 



where the approximation is valid in the regime \x — y\ <C N. The details of the precise formulation and the 
proof are left to the reader. 



3. Preliminaries 



In this subsection we introduce some further notations and collect some basic facts that will be used through- 
out the paper. Throughout this section we work in the general d-dimensional setting of Section [2.1| 

Definition 3.1 (Minors). For T C {1, ...,1V} we define by 

(H^y := l(i 4- T)l-(j £ T)hij . 

Moreover, we define the resolvent of through 



GTAz) := (H 



We also set 

(T) 

E : = E • 

When T = {a}, we abbreviate ({a}) by (a) in the above definitions; similarly, we write (ab) instead of 
({a, b}). Unless specified otherwise, summations are over the set {1, 2, . . . , IV}. Recalling the identification 



o/T^ with {1, 2, . . . , IV}, this convention is in agreement with the one given after (2.3) 
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Definition 3.2 (Partial expectation and independence). Let X = X{H) be a random variable. For 
i£ {1, . . . , N} define the operations Pi and Qi through 

P t X := E(X\H®), QiX := X - P t X . 

We call Pi partial expectation in the index i. Moreover, we say that X is independent of T C {1, . . . , N} if 
X = PiX for all i e T. 

The following lemma collects basic algebraic properties of stochastic domination -<. We shall use it 
tacitly throughout Sections [4]-[7] Roughly it says that, under some weak restrictions, -< satisfies all the usual 
algebraic properties of ^ on R. 

Lemma 3.3. (i) Suppose that X(u,v) -< ^S(u, v) uniformly in u € U and v € V. If \V\ N for some 
constant C then 

^X{u,v) -< ^ *(«,«) 

vev v£V 

uniformly in u. 

(ii) Suppose that X\(u) -< ^\{u) uniformly in u and X 2 (u) -< uniformly in u. Then 

X 1 {u)X 2 (u) -< *i(u)* 2 (u) 

uniformly in u. 

(Hi) Suppose that ty(u) N for all u and that for all p there is a constant C p such that K\X(u)\ p N Cp 
for all u. Then, provided that X{u) -< ty(u) uniformly in u, we have 

P a X(u) < *(u) and Q a X(u) -< 9(u) 

uniformly in u and a. 

Proof. The claims (i) and (ii) follow from a simple union bound. The claim (iii) follows from Chebyshev's 
inequality, using a high-moment estimate combined with Jensen's inequality for partial expectation. We 
omit the details. □ 

Note that if for any e > and p ^ 1 we have 

E\X\ P s? N e V p (3.1) 

for large enough TV (depending on e and p) then X -< ^> by Chebyshev's inequality. Moreover, if X ^ \& 
almost surely, then X -< "J. Hence O^('J') describes a larger class of random variables than 0(*ff). 
We need the following bound on A. 

Lemma 3.4. We have 

AW < -L= (3.2) 

uniformly for z G S . 
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Away from the spectral edges, i.e. for z £ S, this bound was proved in Proposition 3.3 of 23 .In 23 , the 
matrix entries Xij were assumed to have at most subexponential tails (a stronger assumption than (2.11 1 for 
all p), but the proof of [23] extends trivially to our case. See [9] for a simplified and generalized alternative 
proof. 

The following result collects some elementary facts about m. 



Lemma 3.5. We have the identity 

There is a constant c > such that 
on z g S. Furthermore, 



1 - \mV = 



rj\m\ 



Imm 

c |m| s$ 1 

= 1 - r]a + 0(r] 2 ) 



for z £ S, with a given in (1.3 1. We also have the bounds 

Imm x 1, 1 — |m| 2 x r\ 
for z € S. (The implicit constants in the two latter estimates depend on k.) 



(3.3) 
(3.4) 
(3.5) 

(3.6) 



PROOF. The identity (3.3) follows by taking the imaginary part of ( 2.15| ). The estimate (3.4 1 was proved 
25 , Lemma 4.2. From (2.161 we find Imm = 1/a + 0(n), from which (3.51 follows easily using ( 3.3 1. 

□ 



Finally, (3.6) follows from Lemma 4.2 in 25 combined with (3.3) and (3.4). 



The following resolvent identities form the backbone of all of our proofs. They first appeared in 23 
Lemmas 4.1 and 4.2] and 11 Lemma 6.10]. The idea behind them is that a resolvent entry Gy depends 
strongly on the i-th and j-th columns of H, but weakly on all other columns. The first set of identities 
(called Family A) determine how to make a resolvent entry Gij independent of an additional index k ^ i,j. 
The second set (Family B) identities express the dependence of a resolvent entry Gij on the entries in the 
i-th or in the j-th column of H. 



Lemma 3.6 (Resolvent identities; (8j Lemma 3.5]). For any Hermitian matrix H and T c {1, 
the following identities hold. 



Family A. For i,j 7 k ^ T and k ^ i, j, we have 



G 



(T) 



G 



(Tk) 



r (T) r (T) 
(j l k U kj 



G 



(T) 
kk 



Family B. For i,j^T satisfying i ^ j we have 

(Ti) 



r (T) _ r (T) 
U ij — 



h ^G 



G 



(Ti) 



(T) 



G 



(Tk) 



r (T) r (T) 

^ik U ki 
r (T) r (Tk) r (T) 
"ti ^ii ^kk 



.,N} 



(3.7) 



(Tj) 



k 



Definition 3.7. The deterministic control parameter ^ is admissible if 

M-V2 <; yW( z ) ^ M -~</ 2 



(3.8) 



(3.9) 



for all N and z€ S. (Recall the parameter 7 from (2.13).) 



16 



A typical example of an admissible control parameter is 

1 

71?^ 



If ^ is admissible then the lower bound in (3.9| together with (2.12) ensure that h%j -< \&. 
The following lemma gives an expansion formula for the diagonal entries of G. 

Lemma 3.8. Suppose that A -< $ for some admissible ^ . Defining 

(i) 



k,l 



(3.10) 



Gi 



+ m 2 Zi + (* 2 + M~ 1/2 ) 



(3.11) 



PROOF. The claim is an immediate consequence of Equations (9.1) and (9.2) in j8j. (Related but less explicit 
formulas were also obtained in 24 ). □ 

3.1. Averaging of fluctuations. In this section we collect the necessary results from (8). The following 
proposition is a special case of the Fluctuation Averaging Theorem of [8] . 

Proposition 3.9. Suppose that A -< ^> for some admissible control parameter $ . Then 



(3.12) 



and 



(*0 



^ Spa.QaiGfj.aG an) < ^ , ^ S pa Q a (G^G^) ~< ^(tf + M" 1 



-l/4\ 



(3.13) 



as well as 



^s pa s ab G ba G aA1 G; b ~< * 2 (* + Af- 1 / 4 ) 

a^b 



Y, s p- s baG ba G af jG; b x * 2 (* + M- 1 / 4 )' 

a^b 



(3.14) 



PROOF. All of these estimates follow immediately from Theorem 4.8, Lemma B.l, and Proposition B.2 of [8], 
recalling that by assumption Imrn c K by (3.6 1. □ 



The important quantity on the right-hand sides of (3.12), (3.13) and (3.14) is ^. The additional factors 
M -1 / 4 are a technical nuisance, but their precise form will play some role in the large- 77 regime, where 
Af-i/ 4 i s not negligible compared to 'J. 

To interpret these estimates, we note that each summand in (3.12), (3.13), and (3.14) has a naive size 
given by \t , where k is the number of off-diagonal resolvent entries in the summand. Without averaging, this 
naive size would be a sharp upper bound. In the second estimate in (3.12) the averaging does not improve 
the bound since G^ a G* atl = IG^al 2 is positive. In all other estimates, the monomial on the left-hand side 

- 1 / 4 . See [i] Remark 4.10]. 



1 This nuisance is necessary, however, and Proposition 



3.9 



would be false without the factors of M 
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cither has a nontrivial phase or its expectation is zero thanks to Q a . Proposition 3.9 asserts that in these 
cases the averaged quantity is smaller than its individual summands. Note that this averaging of fluctuations 
is effective even though the entries of G may be strongly correlated. How many additional factors of *f? one 
gains depends on the structure of the left-hand side in a subtle way; see Theorem 4.8 of [8] for the precise 



statement. For the applications in this paper the second bound in (3.13) is especially important; here the 
averaging yields a gain of two extra factors of 

We remark that all these bounds also hold if the weight functions s pa are replaced with a more general 
weight function. The precise definition is given in Definition 4.4 of |8j. All the weights used in this paper 
satisfy Definition 4.4 of (8). 

We also note that averaging in indices can be replaced by expectations. We shall need the following 
special case of Theorem 4.15 of (|. 

Proposition 3.10. Suppose that A -< $ for some admissible control parameter . Then for s//j,i/ 

P a {G pa G au ) -< *(* + M- 1 / 4 ) 2 . (3.15) 



4. Self-consistent equation for T 



After these preparations, we now move on to the main arguments of this paper. Throughout this section we 



work in the general d-dimensional setting of Section 2.1 In this section we derive a self-consistent equation 
for T, given in Theorem |4.1[ whose error terms are controlled precisely using the fluctuation averaging from 
Proposition |3.9| In Section [5] we solve this self-consistent equation; the result is given in Proposition |5.1| 

Theorem 4.1. Suppose that A -< '5 for some admissible control parameter ^> . Then we have 



(4.1) 



In matrix notation, 



\m\ 2 S- 



T = \m\ 2 ST- 
where the matrix entries of the error satisfy 



(4.2) 



(4.3) 



The naive size of T xy is of order 'J' 2 . Notice that the error term in the self-consistent equation (|4.2|) is 



smaller by two orders. This improvement is essentially due to second estimate of (3.13) 



Remark 4.2. Instead of averaging in the first index of the resolvent in the definition of T (|2.29|) , we could have 



averaged in the second, resulting in the quantity T' xy := Y]j\G x j\ Sj v . Then T' satisfies the self-consistent 
equation 



T' = \m\ 2 T'S- 



2 S- 



■s', 



where £' also satisfies (4.3). 
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Before the proof we mention that this result also gives a self-consistent equation for the two-sided averaged 
quantity 

Y xz — (TS^) XZ ^ ^ Sxi | Gj y I Sy Z — (ST ^) X z • 

iy 



Taking the average J2 y s yz 01 (4.1 1, we get the following corollary. 

COROLLARY 4.3. Suppose that A -< for some admissible control parameter ^ . Then we have 
Y = \m\ 2 SY + \m\ 2 S 2 + £ and Y = \m\ 2 YS + \m\ 2 S 2 + £ , 



(4.4) 



where £ and £ each satisfy (4.3 I. 

The rest of this section is devoted to the proof of Theorem |4.1| We begin by writing 

^-xy ^ ^ S x iPi | G{y | -\- T X y , ^xy • ^ ^ ^Z'iQil^iyl 



Then by the second formula in (3.13), we have 

f xy = CL^* 4 + * 2 M" 1/2 ) . 



(4.5) 



(4.6) 



Notice that (3.13) applies only to the summands i^y'm (4.5). The estimate for the summand i = y follows 
from 

SxyQy\G vy \ 2 = s xy Q y \Gyy ~ m\ 2 + 2s xy RerhQy(Gyy - m) = 0^(^ 2 M~ 1/2 ) , 

where we used that (G yy —m) ~< ^ (see (3.11 )) and that ^ is admissible, and in particular 'I'M -1 ^ , i ,2 M~ 1/ ' 2 . 
We shall compute J^i s xiPi\Gi y \ 2 up to error terms of order "J/ 4 . We have the following result. 

Lemma 4.4. Suppose that A -< for some admissible control parameter ty. Then 

P^yl 2 = ImfP^y + lrnfdty + O^ + ^M-^+StyO^ + M-V 2 ) (4.7) 

and 



J2 s ^\G*y\ 2 = \m\ 2 J2 s ^y + \ m \ 2 ^y + 0^(^ + ^ 2 M-^ 2 ) 



(4.8) 



(It is possible to improve the last error term in (4.7), but we shall not need this.) Before proving Lemma 



4.4| we show how it implies Theorem 4.1 



PROOF of Theorem 14.11 Equation (4.1) is an immediate consequence of (4.8), (4.6), and (4.5). Hence 
Theorem I4T] follows from Lemma [O] □ 



Proof of Lemma 14.41 Throughout the following we shall repeatedly need the simple estimate 

1 _ 1 1 

Gu 



(1 + 0^)) = 0^(1) 



(4.9) 



where the first step follows from A -< '5, the second from the fact that VE" is admissible, and the last from 
(3.4). In particular, for k ^ i,j, from (3.7) we get the estimate 

GikGkj 



G 



(fc) 



Gi 



G 



(4.10) 
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We start the proof of Lemma 4.4 with the case i ^ y. Using (3.11 ) we get 



\G iy \ 2 - | Gu 1 2 h ik G£ v G* t * h, 



k.l 



(0 



1 + mZi + (* 2 + M- 1 / 2 ) Y, h ik G ky G yl h ' 



k.l 



(0 



(1 + 2ReK)) X! ^ G iy G i?*^ + 0^(* 4 + * 2 M-V2) . 



A'./ 



where in the last step we used Zi ~< and the large deviation bound (see Lemma B.2) 



(4.11) 



(4.12) 



We may now compute the contribution of the main term in (4.11) to Pi\Gi y \ . Still assuming i ^ y, we find 



(0 (0 



fc./ 



fc 

(i) 

fc 

(0 



fey 



GklGiy 



Gi 



E 



|Gfc y | — 2Re Gk y 



I ^ GhiG 



ki^iy 

Gu 



(0 



Ti y — 2Re SikGk 



-^=i GkiGiy 



ky " 



G, 



CL.(* 4 + * 2 M _1 ) 



(4.13) 



In the third step we used (4.9), and in the last step we added the missing term k = i to obtain T iy ; the 
resulting error term is (W^M _1 ) since i^ y. Next, using (4.9) we get 



^ ' SikGky 
fc 



Gfcz 



(0 



r . = -Y Si kG kl G iy G; k + 0^ 4 ) 
Gu m k 

1 (i) 

= -Y, 3ikGkiGi v G vk +°^(* 4 )' 



(4.14) 



In the second step, using (3.7) and (4.9), we inserted an upper index i as a preparation to taking the partial 
expectation Pj. We obtain 

(0 W j 

^E /lifeG 2 G S ) * /l « = r il/ -2Re^^ fc -G fei G^Gg*+O x (* 4 + * 2 M- 1 ). (4.15) 



fc,; 
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Now we take the partial expectation in i in ( 4.15[ ). Using that 

Pi{G kl G ty Gf k *) = G y f Pi(G ki G iv ) < (Syk + ^^ + M- 1 / 4 ) 2 

by Proposition 3.10| and (4.10), we find that Pi applied to the second term in (4.15) results in a quantity 
0^(* 2 (* + M 1/4 ) 2 ). Thus the contribution of main term in ( |4.11[ ) to Pi\G iy \ 2 is 



(4.16) 



k.l 



Next, we look at the contribution of the term with a Z in (4.11 1: 



\m\ 2 P l 



(mZi + mZ>) ]T htkG^G^hu 



k.l 



\m\ 2 P r 



(<) 



k,l 

O x (# 2 M -1 + Af -2 ) 



: £ s ik s u (mG^G^.G^* + mG«*G«G W * 



Ik ky yl 



Ik ky 



k.l 



(0 



= |m| 2 2Re( SlfcS ,G«G«G(f ) + O^M' 1 + M~ 2 ) 

k,l 



In the second step we assumed for simplicity that we are dealing with the complex Hermitian case (2.9); 
thus, thanks to Qi, the only allowed pairing of the entries of H imposes c = I and k — d. (In the real 
symmetric case (2.8), there is one other term, (4.18) below, which is estimated in the same way.) The error 
terms stem from the summands c = d = l = k=/=y and c = d = l = k = y, respectively. Here we used (4.10) 
as well as the bound E|/i 4 c | ^ CM~ 1 Si C ^ CM~ 2 , an immediate consequence of (2.11) and (2.12). In the 
last step we switched the indices k and I. 

Now we remove the upper indices at the expense of an error of size 0_<(\& ), and then add back the 
exceptional summation index i as before. This gives 



PA 2Re (mZ^^hikG^G^hu J = |m| 2 2Re[m^ s lk 

^ k,l ' k.l 

= o^(^ A + ^ 2 Ar 1/2 ) , 



~suGi k G ky G* 



0^(* 4 + * 2 M _1 + AT 2 ) 



(4.17) 



where in the second step we used (3.14); the various cases of coinciding indices k,l,y are easily dealt with 
using the bound Af~ 1/2 ^ 

As remarked above, in the real symmetric case (2.8) the pairing c = k, d = I is also possible. This gives 
rise to the additional error term 



Y,s ik s H G kl G ky G* yl ~< * 4 + * 2 M-V 2 



(4.18) 



k.l 
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where we used (3.14). 



Combining (4.11 ), (4.16) and (4.17) yields 



Pi\G; 



(4.19) 



for i =^ y. This proves (4.7) for the case i ^ y. 



If i = y we compute 



P y \G yy \ 2 = \m\ 2 + P v \G vv -m| 2 



yl^yy 



2P y Re [m(G yy - m)] = |m| 2 + 0^ (* 2 + Af~ 1/2 ) . (4.20) 



Here we used that G yy — m ~< W and that P y (G yy — m) ~< ^ 2 + M -1 / 2 by ( |3.11[ ). It is possible to compute 
this term to high order in iff, but we shall not need this. 

For the proof of (4.8 1 we run almost the same argument as above but now we aim at removing all upper 
indices i. We first consider the summands i j^y. From (4.15) we get 



(i) 



P i f^h lk G^G y fh H = T ty -2ReJ2^k-G kl G iy G; k + 0^(y 4 + V 2 M- 1 / 2 ), 



(4.21) 



k.l 



where we removed the upper index i using (3.7), and included the summand k — i at the expense of a 
negligible error term. Taking the average s x j of the second term on the right-hand side yields 

(v) 

J2I2 s * iSik m GkiGivG *y k = Y, s ™ Sik m GkiGiyG *y k + 0<{ ^ 2M ~ l + M ~^ = °^(* 4 + * 2m ~ 1/2 )- 

i k i,k 

(4.22) 

In the first step we just added the exceptional index i — y, and estimated the additional terms with i = y 
using s xy s yk ^ M~ 1 s y k ^ M -2 as well as G k yG yy G* yk < 5 ky + & 2 - In the second step we used ( 3.14[ ). Note 
that the gain comes from the summation index i. 

Thus the contribution of the main term of (4.11 1 to Yli s xiPi\Gi y \ 2 is 



(v) (0 

|2 \ " „ D \ " . ^(iWi) 



GO 



^s^PiJ^hikG^fhu = Imf^SxiTiy + O^ + ^M-V 2 ). 



(4.23) 



k-.i 



The contributions of the error terms in < |4.11 1 to Y^ijty s xiPi\Gi y \ 2 are of order 0^( , 3> 4 + ty 2 M 1 / 2 ); this is 
true even without averaging (see (4.17)). Thus we have 



(y) (y) 



(4.24) 



Finally, we consider the case i = y. From ( 4.20[ ) we get 



s xyPy\G yy \ 



1 s xy + 0^(M- 1 ^ 2 ) = \m\ 2 s xy + 0^). 



This formula provides the missing summands i = y in ( |4.24 ) and hence yields (4.8). 



□ 
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5. Solving the equation for T 



Throughout this section we work in the general d-dimensional setting of Section |2.1| In this section we 
solve the self-consistent equation (4.2 1 to determine T. This involves inverting the matrix 1 — |m| S. The 
stability of the self-consistent equation (4.2 1 is provided by the spectral gap of S. In the regime of complete 
derealization, this gap is larger (and hence more effective) if we rest rict S to the subspace orthogonal to 
the vector e = iV _1 / 2 (l, 1, . . . , 1) T (see the remarks after Lemma 5.2 for more details). Therefore, we deal 
with the span of e and its orthogonal complement separately. Define the rank-one projection 

n := ee* . 



Thus, the entries ILj of II are all equal to 1/N, and SH = IIS = LT since S is stochastic by (2.3). The 
complementary projection is denoted by LT := 1 — II. 

We perform this splitting on T xy only in the x coordinate, regarding y as fixed. Thus, we split 



Tr 



Ty + (T x 



where we defined the averaged vector 



1 

N ^ 



\G iy 



here the last step follows easily by spectral decomposition of G. We can use the local semicircle law, 
Lemma [3^41 to get 



T„ 



Im m 

Nrj 



1 + 0- 



1 



iMr\ 



It is instructive to perform the same averaging with the deterministic profile 0: 



n- 



\m\ 2 S 



\ 2 S 



1 



N 1 - |m| 2 



Imm 

Nrj 



(5.1) 



(5.2) 



where in the last step we used (3.3 1. 

Having dealt with the component HT in (5.1), we devote the rest of this section to the component HT. 
The following proposition contains the main result of this section. 

Proposition 5.1. Suppose that A -< "J for some admissible control parameter ^ . Then we have for all y 

S-U 



T 



1 - IrnPS 1 



xy 



where the error satisfies 



max \£ X y\ -< 
x,y 



(5.3) 



(5.4) 



V 



The main tool in the proof of Proposition 5.1 is a control on the spectral gap of S on the space orthogonal 
to e. In order to state it, we need the Euclidean matrix norm ||^4|j as well as the l°° — > i°° norm of the 
matrix A, 



\A\\ 



max 2 \Ai 
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The following lemma shows that S has a spectral gap of order (W/L) 2 when restricted to the space orthogonal 
to e. Its proof is postponed to the end of this section. 



Lemma 5.2. We have the bounds 



and 



1 n 



l 2 s 



1 n 



1 - \m\ 2 S 



C 



»>+(£) 

Clog AT 
^+(f) 2 



(5.5) 



(5.6) 



In the regime of complete delocalization, ij ^ (W/L) 2 , the control on (1 — Imps') 1 is stronger on the 
space orthogonal to e. Indeed, in that regime the bound (5.5) is better than the trivial bound 



1 - |m| 2 S 



1 



C 



(5.7) 



from (3.6 1 . 



PROOF of Proposition 15. 11 Multiplying ( 4.2 ) by II from the left yields 

ut = \ m \ 2 sTiT + H 2 (s-n) + ns, 

where we used that SII = IIS = II. Therefore 

nr = |m 



2 s - u + e. 



i 



m| 2 S 



£ := 



1 



1 - |m| 2 S 



-us. 



Note that (JlT) xy = T xy — T y . Using (5.6) we therefore get (5.3) whose error term satisfies 



max\£ xy 



x,y 



1 



|m| 2 S 



n 



max|f xs/ | -< 



x,y 



(* 4 + ^M- 1 ' 2 ) 



This completes the proof of (5.3) and (5.4). 



□ 



Next, we estimate \Gij — 5ijin\ 2 in terms of Ty. In other words, we derive pointwise estimates on 
from estimates on the averaged quantity T xy . This gives rise to an improved bound on A, which we may 
plug back into Proposition |5.1| Thus we get a self- improving scheme which may be iterated. 



Lemma 5.3. Suppose that A -< \1/ with some admissible control parameter ^ and Tij -< f2?- for a family of 



admissible control parameters indexed by a pair (see Definition 3.1). Then 



\Gij - Sijm\ 



'k s ki 



(5.8) 



(Here we write f2 2 - := (VLij) 2 .) 
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Proof. We fix the index j throughout the proof. Let first i ^ j. Then (3.8) gives 



Gij — Ga ^ ' hikG k 



(i) 
kj 



(5.9) 



We shall use the large deviation bounds from Theorem |B.1| to estimate the sum. For that we shall need a 
bound on 



(0 



^ij &ii | I 



C^(* 4 ) -< ^ + * 4 , (5.10) 



where in the first step we used (3.7) and (4.9). Since Ga -< 1, we get from (5.9) and Theorem B.l (i) that 



\g 13 \ 2 ~< + 



To estimate Ga — m, we use (3.11) to get 



\G lt -m\ 2 s? C\Zi\ 2 + 0^ 4 + M~ 1 ) ~< ^n^H + t^ + M- 1 . 



(5.11) 



Here we used 



\Z,\ l < 



(i) 2 

where in the second step we used Theorem B.l (i) and (ii), with the bounds ~> 1 and 



This last estimate follows along the lines of (5.10), whereby the error terms resulting from the removal of 
the upper indices are estimated by Cauchy-Schwarz; we omit the details. Finally M -1 can be absorbed into 
J2k ^ik s ki by admissibility of Oy. □ 

We may now combine Proposition |5 . 1| and Lemma |5.3| in an iterative self-improving scheme, which results 
in an improved bound on A. 

COROLLARY 5.4. Suppose that A -< ^ and -< fl 2 for all i and j, where \& and fl are admissible control 
parameters. Then 

A 2 -< il 2 . (5.12) 

Proof. We apply Lemma [5. 3| t o the constant control parameter = fl for each i,j. Thus, suppose that 
Tij -< fl 2 for all Lemma |5.3| yields 

A 2 -< ty 2 =^> A 2 ~< n 2 + * 4 . 

Now we can iterate this estimate, ft 2 + 'J 4 taking the role of \I/ 2 in controlling A 2 . Thus after one iteration 
we get 

a 2 -< n 2 + (n 2 + ^ 4 ) 2 -< fi 2 + * 8 . 

After k iterations we get A 2 -< Q 2 + ^ 2 . Since fl and ^> are admissible, we have \E' 2 -< fl 2 for k ~ | log7|. 
This completes the proof. □ 
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What remains is the proof of Lemma 5.2 which relies on Fourier transformation. We introduce the dual 

2tt 



lattice of T = T d L , 



P = Pt := 



L 



■ T 



L ■ 



For p e R d define 



Z L ,w \W 



(5.13) 



In particular, if p G P then S(p) is the discrete Fourier transform of s x q. Since s xy is translation invariant 
and L-periodic, we get for all i,i/£T that 



[x-y] L jy 



p£P 



Proof of Lemma 15.21 First we show that for large enough L the Euclidean matrix norm satisfies 

2 



|sn|| < i-ci 



(5.14) 



with some positive constant c\ depending on the profile /. Since the matrix entries Sij are translation 
invariant (see (2.2 1), it is sufficient to compute its Fourier transform as defined in (5.13). Using the property 
Su(p) — S(p)u(p), the fact that LT(p) = 5 p q, and Plancherel's identity, we find 



|STI|| = max{ \S(p)\ :p € P\{0}} ^ 1 - c x 



(5.15) 



The last step follows easily from S(p) ^ — 1 + S (recall (2.4)) and the representation 

x£T ■ x 7 x£T ' x ' 

where in the second step we used \p ■ x\ 7T, and in the last step \p\ 2n/L. 
From (3.6) we get 1 — \m\ 2 crj, which, combined with (5.15), yields 



> Ci 



isnii sc i- 



iT' 



(5.16) 



Thus we get 



1 - \m\ 2 S 



n 



J2\m\ 2k \\SU\\ k 



C 



fc=0 



2 ■ 



This is (5.5 



In order to prove (5.6), we first observe that H^Hoo ^ 1 as follows from the estimate 



max |(iSV)j| ^ max 



< max liv 
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where v = (i>j) is an arbitrary vector. Thus, for any vector v satisfying (v, e) = any ko £ N we have 



1 - \m\ 2 S 



fc -l 



EH 2fe ||5 fc v|L+E 



< 2^ l TO 

00 fc=0 



| 2fe ||5 fc v| 



k— ko 
|2fc|| CTTllfel 



< *d||v||oo + X) l m | 2fc H Sn l| fe |l V ll2 

k— k{\ 

l 2fc nl| ( 5JJ||*o 



< *0 V 



ml 



1 - |m| 2 ||SH|| 



where we used the bound HvH^ ||v|| 2 < V^V|| v|| ^ and ( pT4| ). Choosing k = C(l og jV )[??+ (T^/L) 2 ]' 1 with 
a sufficiently large constant C, we obtain (5.6) exactly as above using the bound (5.16). This completes the 
proof of Lemma 5.2 □ 



6. Delocalization bounds 



In this section we prove our main results - Theorems |2.2| |2.5[ and |2.11| We return to the one-dimensional 
case, d = 1, and continue to assume (2.171. In particular, we write N instead of L. The simple extension to 
higher dimensions is given in Section[8j 



6.1. Delocalization without profile: proof of Theorem 



2.2 



parameter Then ( |5\3| together with ( pT4| ), ( |5Tj ), ancH |2.38| ) yield 
1 



1 



1 



Recalling Corollary |5.4[ we have therefore proved 



(* 4 + y 2 w- 1/2 ) -< $ 2 



Suppose that A -< \1/ for some admissible control 
(* 4 + ^W' 1 ' 2 ) . 



(6.1) 



A -< f 



N 2 



A 2 -<; $ 2 + — (* 4 + * 2 VF" 1/2 ) 
W 2 y ' 



(6.2) 



i.e. the upper bound A 2 -< ty 2 can be replaced with the stronger bound (6.2) 



We can now iterate (|6-2j), exactly as in the proof of Corollary 5.4 We start the iteration with ^0 
(Wr/)~ 1 / 2 ; see Lemma 3.4 Explicitly, the iteration reads 



fc+i 



:= & 



W 2 ^ 



-l/2\ 



From (6.2 1 and Lemma 3.4 we get that A 2 -< for any fixed k. 

1 r 

r *o < 1 and 



In order perform the iteration, we require 

TV 2 



iV 2 



W~ 1/2 < 1. 



(6.3) 



W 2 u W 2 
Thus we get the conditions N <C W 5 ^ 4 and 77 ^> N 2 /W 3 . (Here we used (2.1)). Satisfying these two 



conditions is the reason we need to impose the restriction on W in Theorem |2.2[ Corollary [2741 and Theorem 
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2.5 Using (6.3) and the fact that $ is by definition admissible, it is now easy to see that there is a finite 



constant k, which depends on the implicit constants c in -C and ^> above, such that 'J' 2 , ^ C$ . This 
concludes the proof of Theorem |2.2| 



6.2. Derealization with profile: proof of Theorem 2.5 By assumption we have (W/N) 2 ^ rj ^ 1, so that in 
particular <£> 2 = W^ 1 ^ 1 / 2 —: ^ 2 . Note that this ^> is admissible. From (2.20) we get A ~< Now observe 
that Imm 



N-q 



n xy is^ for all a; and y, as well as 



Imi 



-n 



s-u 



s 



i 2 \S 



1 - \m 2 \S 



by (|3.3[) and the property ITS' = STI = II. Thus ([5J3 ) together with (|5_4j) and ( 
of (2.30), since in the regime r\ ^ (W/N) 2 and W b/4 ^> N the error term (5.4 



5.1 ) implies the first estimate 
) is bounded by 



y2 w -l/2\ ^ 



V 



c 

Nr) 



The second estimate of (2.30) follows from the first one and (4.7). 
Next, ( |2.31[ ) follows by using \2M\ in ( |2.30 ). 

Finally, using Lemma 5.3 with f2^- = Ty and >I> := VF" 1 / 2 ^ -1 / 4 , we obtain 



Sijm\ 



(6.4) 



Here we used that "J 4 can be absorbed into (Nrj) 1 ^ Tjj and in the last summation TjfcSfej can be 



absorbed into T,, ^ ^p^ . This proves (2.32), and hence concludes the proof of Theorem 



2.5 



6.3. Derealization with a small mean-field component: proof of Theorem 2.11 In order to prove Theorem 
|2.11| wc repeat the arguments from the previous sections almost to the letter. The self-consistent equations 
from Theorem 4.1 remains unchanged except that S £ replaces S in (4.2). The key observation is that, on 
the subspace orthogonal to e, the lower bound on 1 — |mpS £ is better than that on 1 — |m| 2 5. Indeed, using 
(3.6) we get 

(l-\m\ 2 S e )U = 1-H 2 (1-£)STT ^ 1-(1-ct/)(1-£)STT ^ c(r) + s) 



with some positive constant c. This implies that (5.5) and (|5.6|) can be improved to 

C 



1 



1 



Imps'. 



n 



v 



■ e 



+ (fr 



i 



1 - \m\ 2 S e 



n 



ClogiV 



■ e ■ 



'W\ 2 
. N ) 



(6.5) 



Suppose now that A -< 'J for some admissible control parameter 'F Then the statement of Proposition 
15. II is modified to 

„|2 / 



T 



S e -IL 
1 - \m\ 2 S £ 



(6.6) 



XtJ 



where the error term satisfies 



max \£ xy 

x,y 



-< 



w\ 2 



i^ + ^w- 1 / 2 ). 



(6.7) 
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Notice that the Fourier transforms of S and S e (defined by (5.13)) satisfy 

S £ (p) = (l-e)S(p)+e5 pQ . 



(6.8) 



Thus we have 



1 - \m\ 2 S E 



1 E 



e ip(x-y) 



(1 - e)S(p) 



peP.p^o 



J_ V" e ip(»-») (i - g)>g(p) 
^ i-H 2 (i-£). 



Imm 

l-|m| 2 (l-£)5(]5) H^7 
Imm 







\m\ 2 (l-e)S{p) \m\ 2 Nrj Vfa + e)^ 



(6.9) 



Here we treated the zero mode p = separately; it is given by 



-T.(- 

AT ^\ 1 



m 2 5 E 



n- 



1 - m 2 & 



AT(1 - |m| 2 



Imm 

\m\ 2 Nr] ' 



where in the last step we used (3.3). The error term in (6.9) is estimated using a similar calculation. 

Notice that the coefficient of S(p) in the denominator of (6.9) is now |m| 2 (l — e) = 1 — e— (1 — e)arj+0(r) 2 ), 
where we used (3.5). The results and the proof of Proposition 2.8 remain unchanged when S is replaced 
with S £ , except that ar\ must be replaced with (1 — e)arj + e on the right-hand side of (2.36 ), and the whole 
expression is multiplied by an additional factor (1 — e). Moreover, instead of (2.38), we now have 



max 

xy 



S, 



1 - |m| 2 5 £ 



1 



Nrj Wy/rf+e 



(6.10) 



Recall the definition (2.20) of $ e . Following the proof of Theorem 2.2 instead of (6.2) we now obtain 

1 



A -< $i 



Tj + e - 



> N i 



-(* 4 + * 2 w-v 2 ) 



(6.11) 



As in Section 6.1 we can iterate (6.11) under the conditions 

2 



1 



W~ 1/2 < T] + e + 



(6.12) 



(Note t hat t he a priori estimate (Wrj) 1 is still determined by W despite the small mean-field component. In 
Lemma 3.4 it is given by (Mr])" 1 / 2 where M = (max tJ s^)" 1 ~ (eN^ 1 + W~ 1 )~ 1 ~ W .) The first condition 
of ( |6.12| ) holds if 



and the second holds if either 



r]{r] + e) > W' 1 , 
r, + e > W~ 1/2 
N < W 5/i . 



(6.13) 
(6.14) 
(6.15) 



This concludes the proof of part (i). 
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In order to get complete derealization of the resolvent, i.e. A 2 -< (Nrj) 1 , we require A -< $ 2 as well as 

ITV^Te ^ At?, (6.16) 

which ensures that $ e = (Nr])^ 1 . Hence we get complete derealization of the resolvent provided that ( 6.13| ), 
(6.14), and (6.16) hold. This concludes the proof of part (ii). 

If £ 3> (A/IT 2 ) 2 / 3 then there exists an r\ such that the assumptions of part (ii) are met. Hence part (ii) 



and Proposition 7.1 yields part (hi). This concludes the proof of Theorem 2.11 



7. Complete delocalization of eigenvectors 



In this section we derive a delocalization result for the eigenvectors of H, using the complete delocalization 
A 2 -< (Nrj)^ 1 as input. We denote the eigenvalues of H by Ai ^ A 2 ^ ••• ^ Ajv, and the associated 
normalized eigenvectors by Ui, u 2 , . . . , u^r. We use the notation u a = (u a (x))^ =1 . We shall only consider 
eigenvectors associated with eigenvalues lying in the interval 

I := [-2 + «,2-«], 

where k > is fixed. 

For I = £(L) define the characteristic function P x j projecting onto the complement of the ^-neighbourhood 
of x, 

PxAv) : = l(\v-x\>£). 
Let e > and define the random subset of eigenvector indices through 



Ae,t ■= | a : A Q S I , y~]\u a (x)\ ||P x ,/u tt || sC e| , 



which indexes the set of eigenvectors localized on scale £ up to an error e; see Remark 7.2 below for more 
details on its interpretation. 

PROPOSITION 7.1 (Complete delocalization of eigenvectors). Suppose that A -< ^ for some admis- 
sible control parameter ^> . Let r\ = ?yjv be a sequence satisfying M~ 1+1 ^ r\ -C 1. Suppose that 

sup|G^(£ + i77)| 2 -< ^+6^, (7.1) 
Eel 

Let £ <C N. Then we have for any e > 

1-4 



N 



^ CVe + 0^{N- c ). 



Remark 7.2. The set A s ,t contains, in particular, all indices associated with eigenvectors that are exponen- 
tially localized in balls of radius 0(1). In fact, exactly as in [7j Corollary 3.4], Proposition 1 7. 1| implies that 
the fraction of eigenvectors subexponentially localized on scales £ vanishes with high probability for large N. 
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Proof of Proposition 17. 11 As usual, we omit the spectral parameter z = E+irj, where E e / is arbitrary 
and 77 is the parameter given in the statement of Proposition 7.1 By assumption on A, we have for all x 



Inn 



Imm 



(7.2) 



uniformly in E 6 I, where in the first step we used the spectral decomposition of G. Thus, for all x, the 
map y H> T^hn\G y x\ 2 is approximately a probability distribution on {1, . . . , N}. Roughly, (7.1) states that 
this probability distribution is supported on the order of N sites of {1, ... , N}. More precisely, (7.1 ) yields 
(introducing the standard basis vector S x defined by (S x )(y) := 5 xy ), for any fixed x, 



n 



Im m 



\Px,e G5 X \ 



<1 



Imm 



>£)\G yx f 



n 



Imi 



Imm 



^lQy-x\ <l)\G vx f 



= 1 + 0^) + 0^ 
= 1 + 0^(A- C ), 



V ( N 



Imm V Nrj 



n-c 



(7.3) 



uniformly in E € /. Here in the third step we used (7.2) and (7.1), and in the last step the upper bound 
77 ^ M~ c and the fact that ^ is admissible. 

In order to obtain a statement about the eigenvectors, we do a spectral decomposition G = , 
which yields for arbitrary ( > 



u Px,£ G 5 X II — 



n 



Imm 



Imi 



c 



Im m 



E 

aGA,. 



X a — z 



u a (x) P x ,lU a 



Imm 



(i + O 



E 

aeA° e t 



A Q — z 



u a {x) P x ,tU a 



(7.4) 



where we introduced the complement set A° e » '■— {1, . . . , N} \ A £ j. In order to estimate the first term on 
the right-hand side of (7.4), we write 



V 



Imm 



E 

aeA e , 



X a — z 



u a (x) P x ,lU a 



V 




Imm 






a 


V 




Imm 


I 






Im G xx 



E 



X a — z 



u a (x) u a 



Im m 



\Ug(x)\ 2 

\K-z\ 2 
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Therefore we may estimate the left-hand side by its square root to get the bound 

2 

r _ u a (x) P x ,£U a 



Imm 



aeA E 



< (l + O x (*)) i; 
u 1mm 

< (C + 0^(*)) J2 



V" T U a (x)P x 

' ' An, — Z 



{E-\ a f+7f 



\u a (x)\ \\P Xt iU 



(7.5) 



where in the last step we used (3.6). 



Similarly, we may estimate the second term of ( 7.4 1 using 



x) P x tVL c 



^2 , u a (x)u c 



E 



\u a (x)\ 2 



(7.6) 



Combining (7.4) with (7.3), (7.5), and (7.6), we get 



l + 0^(iV- c ) < (C + O^N- c ))(l + ±) J2 



a£A Cl e 



(E-\ a ) 2 + v 2 



\u a {x)\ ||P x ^u a || 

uniformly for E € I. Now taking the average |/| _1 Jj dE and using Jensen's inequality, we find 
1 + 0^(N- C ) < (c + 0^(^V- c ))fl + i) ^ \u a (x)\\\P x ,txi a \\ 

V \u a {x)\ 2 



aeA E 



<*eA° e 



Imm |A a — E — vrj\ 2 ' 



where we used that 



dE- 



I j (E - \ a ) 2 + V 2 
for a 6 A Sl £. Averaging over x, i.e. taking N^ 1 J2 x i yields 



1 + 0^(N- C ) < 



(C + 0.(7V- c ))(l + ^e+(l + C)^/d£;l £ 



Imm |A a — E — ir/| 2 



(7.7) 



(7.8) 



where we used the definition of >4 £ /. We may estimate the integral as 



1 1 J c.a= 



'/ 



1 



Imm |A Q — E — irj\ 2 \I\ J N Imm 



1 ImTrG „ , , 

dE— — = 1 + 0^) 
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Setting £ = y/e in (7.8) therefore yields 



1 1 J aeA c , 



Imm \X a — E — ir/| 



> 1- (C + 0^(N- c ))^-0^{N- c ) > 1~C^-0^{N- C ). 



Using 



I J n 4^ 



Imm |A Q — E 1 — i??| 2 |/| j Imm 



1 ImTrG ^ , . 

dE — — = l + O x (*) 



we therefore get 



1 1 J a£A e 



Imm |A Q — E — ir)\ 2 



< CVe + 0^(N- c ). 



The claim now follows from (7.7). 



□ 



8. Extension to higher dimensions and a slowly decaying band 



In this Section we extend Theorem |2.2[ Corollary |2.4| and Theorem |2.5| in two directions: higher dimensions 
d and a slowly decaying band. 

The multidimensional analogues of the slowly decaying profile are left to the reader, as is the formulation 
of these extensions if a small mean-field component is added to the band matrix. All these results can 
be obtained in a straightforward manner following the proofs for the one-dimensional case with a rapidly 
decaying /. 

8.1. Higher dimensions. Fix d = 1, 2, 3, ... , and recall that N — L d and M x W d . Throughout this section 
we continue to assume (2.17). The following lemma gives the sharp upper bound on the size of Q xy defined 
by the formula (2.24), where x and y take on values in the d-dimensional torus = T. 



Lemma 8.1. Let d = 2,3, . . . and assume (2.17|). Then there is a constant C such that 

e, 



'xy 



1 1 

M' Wq 



for all x and y. 

Proof. See Appendix [A} □ 
In order to state the precise form of the profile Q xyi we define the covariance matrix D = Dw through 

1 ^ ^ XiXf 



u * - 2 ^ W 2 

xGT 



SxO ■ 



(8.1) 



We have 



D 



o{w- r ) 



where 



(A»)i. 



XiXjf(x) dx . 
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Since Dqo > we get D ^ c > uniformly in W . 

Next, we define the d-dimensional Yukawa potential 



V(x) 



dq e iqx 
(2n) d 1 + q 2 



(x € R d ) , 



where the integral is to be understood as the Fourier transform of a tempered distribution. For d = 1 we have 



V(x) 



|e l a l and for d - 



3 we have V{x) 



of type \x\ 2 d at the origin, and in d = 2 dimensions a singularity of type log|cc|; see e.g. 
for more details. The leading-order behaviour of the profile O x q is given by 



Generally, in d ^ 3 dimensions V has a singularity 

Theorem 6.23] 



28 



N 



E 



x(w P ) 



III 



l {ar,) d / 



2-1 



m] + W 2 (p ■ Dp) 



W d VdctD 



E 



arj ) 



D- 1/2 (x + kL) 



(8.2) 

where x is a smooth function satisfying x(q) — 1 f° r \l\ ^ 1/2 and x(l) = f° r \q\ ^ 1, <p is a Schwartz 
function satisfying J <p = 1, and ^t(a;) := t~ d ip(x/t). (In fact, </?(I? _1 / 2 x) is the Fourier transform of x(<?).) 
The second step of ( |8.2[ ) follows by Poisson summation; see Appendix [A| and in particular (A. 14 1 for more 
details. The following lemma gives the precise error bounds in the approximation (8.2). 



Lemma 8.2. Let d= 1,2, 3,... and assume (2.17). Then 



>x-y 



o 



K 



— (-) 

w d \w I 



-K 



ifl 2 I y/fjx \ K 



Proof. See Appendix [XJ 



(8.3) 



□ 



Remark 8.3. The convolution in (8.2) smooths out the Yukawa potential on the scale x m W. The error 
terms in (8.3) are negligible compared to the main term 9 X in the regime W < |x| s$ CWir 1 ' 2 - Therefore 



the approximation 8 is meaningful from the profile scale Wr] 1 / 2 down to the band scale W. The actual 
choice of the function \ in (8.2) is immaterial in the relevant regime \x\ ^> W, as long as x is equal to one 



in a neighbourhood of the origin. 

Next, we state the counterparts of Theorem |2.2| Corollary |2.4| and Theorem |2.5| in the higher-dimensional 
setting. Their proofs are trivial modifications of the proofs of their one-dimensional counterparts, using 
Lemmas 18.11 and 18.21 



Theorem 8.4 (Improved LOCAL SEMICIRCLE law). Let d = 2, 3, . . . and assume (2.17). Suppose moreover 
that L < W 1+d l A and r] 3> L 2 /W d+2 . Then we have 



A -< max 



1 1 



(8.4) 



M ' _ 
for z G S. 

Corollary 8.5 (Eigenvector delocalization). Let d = 2, 3, ... and assume ( |2.17[ ). If L < w 1+d / A 
then the eigenvectors of H are completely delocalized in the sense of Proposition pO} 
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Theorem 8.6 (Diffusion profile). Let d = 2,3, ... and assume \2.17\ . Suppose that L < W 1+d / A and 
(W/L) 2 < j] < 1. Then 



\Txy ®xy I ^ 



i 



i 



Nr) M 3 / 2 t] 



Px\G xv \ -5 xy \m\ -\m\ <d x . 



-< 



Nr] M 3 / 2 t] 



(8.5) 



Moreover, the analogues of (2.31) and (2.32) hold with 

d/2-1 



1 / x-y 



where K is an arbitrary, fixed, positive integer. 



W d 



8.2. Slowly decaying band. In this section we make the following assumption on the band shape. Suppose 
that d = 1 and / is smooth and symmetric, and satisfies 



fix) = k{ ^- 

for some fixed f3 € (0, 2). Here h is a symmetric function satisfying 

\h(x)-h \ ^ C(x)~ 3 



(8.6) 



•7) 



for some fixed ho > 0. Note that by definition / is smooth and symmetric, so that h(x) = 0(|a;| 1+ ^) near 
the origin. 

In order to avoid technical issues arising from the periodicity of S, we cut off the tail of / at scales x « N. 
Thus we set 



s xv := \/Y' r '' /V 
xy Z J \ W 



[x - y\w 
N 



here a is a smooth, symmetric bump function satisfying a(x) — 1 for \x\ ^ a and o~(x) = for \x\ ^ b, where 
< a < b < 1/2. As usual, Z is a normalization constant. 

The following lemma is the analogue of Lemma |5.2| Its proof is similar to that of Lemma |5.2| the key 
input is Lemma A. 2 (iii). 



Lemma 8.7. Suppose that d = 1 and that (8.6) and (8.7) hold. Then 

Clog TV 



1 



1 - Imps' 



n 



<: 



Next, we give the sharp upper bound on the peak of the profile. 



Lemma 8.8. Suppose that d—1 and that (8.6) and (8.7) hold. Then 



Q xy ^ C max 



r, 1 ^- 1 + 1 1 
W 'Nrj 



Proof. See Appendix |A") 



□ 
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In order to describe the asymptotic shape of the profile, we define 



B 



h 



W 



du ■ 



1 — cos u 

|u| 1 +' 3 



ho / du ■ 



1 — cos u 



1 1+0 



O 



W 



which plays a role similar to the unrenormalized diffusion constant D from (2.26). Moreover, define the 
function 

dq e u 



V(x) 



2vr 1 + \q\P ' 

which is bounded for f3 > 1. It is easy to check that for f3 > 1 and \x\ ^ 1 we have 

V(x) = Cplxl- 1 -? + 0(\x\- 2 - fl ) 
with an explicitly computable constant Cp > 0. 



(8.9) 



Proposition 8.9. Suppose that d = 1 and that (8.6) and (8.7) hold for some (3 > 1. Suppose moreover that 

(8.10) 



A* 



< 7? < 1. 



TTien i/iere is a constant c > 0, depending on the implicit exponents in (8.10), such that for x £ T we have 



|m| 2 / a?7 
War/ V 



r 



IF 



W 



o 



w 1 



(8.11) 

□ 



Proof. See Appendix [X) 

The matrix O is the resolvent of a super diffusive operator, whose symbol in Fourier space is l?|Wp|' 3 . 
Thus, under the identification t = n -1 from Remark 2.7 we find that the associated dynamics scales according 
to x ~ Wt 1 ^ instead of the diffusive scaling x ~ Wt 1 / 2 . 

We may now state the counterparts of Theorem |2.2| Corollary |2.4[ and Theorem |2.5| for the slowly 
decaying band. Their proofs are trivial modifications of those for the strongly decaying band, using Lemmas 
IQandET 



Theorem 8.10 (Improved local semicircle law). Suppose that d = 1 and that (8.6) and (8.7) hold. 
Suppose moreover that N < W 1+1 / 2f} and 77 > {N/W f/W. Then we have 



A -4 



r) 1 /?- 1 + 1 J_ 
Nr] 



W 



112) 



/or z€ S. 



Corollary 8.11 (Eigenvector delocalization). Suppose that d = 1 and that ( |8.6[ ) and ( |8.7|) /told. // 

A" -C W 1+1 / 2 P then the eigenvectors of H are completely delocalized in the sense of Proposition 



7.1 



Theorem 8.12 (Diffusion profile). Suppose that d — 1 and that (8.6) and (8.7) hold for some j3 > 1. 
Suppose that N < M/ 1+1 /2/3 ana > (w/N)P < 77 < 1. TTien 



-^c I G xy I 



|m| 2 8. 



1 5. 

-*\ — — h 
Nr] 



xy_ 

W 



3.13) 



Moreover, the analogues of (2.31) and (2.32) ZioZd. 
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A. The deterministic profile 



In this appendix we establish bounds and asymptotics for the deterministic profile O xy . 



A.l. Proof of Proposition 



2.8 



We assume d = 1 and abbreviate T = T]y as well as P = Pjy. We assume 



without loss of generality that i€T. 

First we notice that by the symmetry of / and (2.17), the Fourier transform f{q) := L e~ lqx f(x) dx is a 
smooth function with a Taylor expansion 



/(<?) = 1 - D^q 2 + q 4 g(q) (q G 



(A.l) 



where g is a bounded smooth function satisfying g(q) = g(—q). Clearly, / is real and ||/||oo ^ 1- Moreover, 
we claim that for any e > there exists an e' > such that 



if 



M > e: 



indeed, this follows easily from the identity 

1 "/(<?) = / (l-coB(gaO)/(aOd*. 

Next, define the lattice 



For q e [— ttW, irW] define 
Thus we get 

S 



2nW 

Q := WP = — T. 



S w (q) := S(q/W). 



1 - \m\ 2 S 



jL pip 31 

N ^ l 



S(p) 



\m\ 2 S(p) N 



SW(g) 



96Q 



l-|m|2^( g ) 



for ieT. 

As a guide for intuition, we have Sw(q) ~ f(l), as can be seen from 



Sw(q) = 



^—ixq/W 



1 



■>N,W 



JV/2 



AT/2 



ac \ 1 



dx 



N/2W 



N/2W 



Thus, our proof consists in controlling the error in the approximation 



S 



1 — \m\ 2 SJ xo 



1 

TP 



Jqx/W _ 



/(<?) 



1 - \m\ 2 f(q) 



dq. 



(A.2) 



(A.3) 



e" i9 V(y) dy « /(?) . 



(A.4) 



As a first step, we establish basic properties of Sw that are analogous to (A.2) and (A.l). 

Lemma A.l. The function Sw smooth with uniformly bounded derivatives, real, and symmetric with \ Sw(q)\ ^ 
1 and SV(0) = 1. Moreover, it has the following properties. 
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(i) For any e > there exists an e' > such that 

Sw(q) <l-s' if \q\ > e 
for large enough W (depending on e). 
(ii) For any k,K £ N there is a constant C'k.K such that 

\d k q S w (q)\ < C ^+0(N- 1 ), (qe[-nW,nW}). 



(A.5) 



(A.6) 



(Hi) There exists a smooth function gyy whose derivatives are bounded uniformly in W such that 

S w (q) = l-Dq 2 +q 4 g w (q), {q € [-ttW, TtW]) . (A.7) 



PROOF. The proof of (i) is similar to that of (A. 2 1. 

To prove (ii), we use summation by parts combined with (2.17). Let the integers N_ and N + denote the 
end points of T = [AT/2, N/2), i.e. T = [JV_, N + ]. Then we hnd 



S w (q) 



n+ 

- E 



-iqx/W 



N+-1 



x - E E 



-iqy/W 



x=N_ y=N_ 



N+-1 

E 



-iqN_/W _ -iq(x+l)/W 



c 

~w 

c 



K \ ~> 1 

V 2-i |l — e -i 



1 _ e -iq/W 



1 / X 

i/ w \W\W 



-K 



c 



K 



x + l 
W 



1 



X + l 

W 



N+ 
W 



O 



K 



N+ 

e ~ iqv/w 

y=N_ 

N /W^ ls 

w I at 



K-l 



\q\ 



Here we used that 



(recall that \q\ < irW) and that 



CW 



|l_ c -i?/W| ^ \ q \ 



x + l 
W 



w \w/ 



(A.8) 



to estimate the main term, and (2.1) to estimate the error term with K ~ 1/(5. We also have the trivial 
bound on \Sw\ ^ 1- Thus we have 

\Sw(q)\ < ^ ) +0(N~ 1 ). 



38 



We can iterate the above argument for the main term in ( A. 8 1, thus obtaining higher order divided differences 
of /. Since / is smooth and decays rapidly, (A. 6 1 follows for k = 0. The proof for k > is analogous. 
In order to prove (hi), we write 



1- Dq 2 -S w (q) 
9w[Q) ■= ^ 



1 



E 



1 - {qx/Wf/2 - co%{qxjW) 

(qx/wy 



Now (hi) follows from the fact that the function h(q) := (l — q 2 /2 — cos(g))g 4 is smooth and its derivatives 
are bounded. □ 

Having proved Lemma |A.1[ we may now complete the proof of Proposition |2.8| Fix a small constant 
e > and introduce a partition of unity x + x = 1 on M with smooth functions such that = 1 for |g| ^ e 
and x{q) = for \q\ > 2e. Since Sw{q) ^ 1 and \m\ 2 < 1 — cry (see (3.6)), we have 

l-\m\ 2 S w {q) > l-|m| 2 > cry 



with some positive constant c. By (A. 5) we have on the support of x 

1- \m\ 2 S w (q) ^ e' 



(A.9) 



for some s' depending on e. Then from (A.3) we have 

Sw(q)x(q) 



S 

1 - \m\ 2 S 



N 1 - \m\ 2 S w (q) N ^ 



qeQ 



qeQ 



Sw(q)x(q) 

1 - \m\ 2 S w (q) 



N ^ i 

qeQ 



Here we used that the function 



R(q) 



Sw(q)x(q) 

- \m\ 2 S w (q) 

Sw(q)x(q) 

1 - \m\ 2 S w {q) 



Ok 



1 / x 
W\W 



-K 



(A.10) 



extended to the whole real line, is smooth and its derivatives are bounded uniformly in N and W (by 
(A.9)). (These bounds may of course depend on e). Moreover, R(q) = Ok({q)~ ) for any K; see (A. 6). By 
summation by parts, as in (A.8), we find that for such a function we have 



1 £ c^ w R(q) = K 



qeQ 



1 / x 
W\W 



-K 



(ATI) 



for any K . 

Now we consider the first term in (A. 10). For the following we use Ai(q,r),N,W) with i = 1,2,3,... to 
denote functions that are smooth in q and whose g-derivatives are uniformly bounded in q, ry, W, and N. 
Using the Taylor expansion (A. 7) and (3.5), we have (omitting the arguments for brevity) 

l-\m\ 2 S w = ar] + Dq 2 + A iq 4 + A 2 r]q 2 + A 3 r] 2 . 
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This gives (again omitting the arguments) 
Sw 1 



1 - \m\ 2 S w ar l + D( i 2 



X 



A 4 q* + A 5m 2 + A 6 r] 2 



(at) + Dq 2 + A x q 4 + A 2 r\q 2 + A 3 r] 2 ) [arj + Dq 2 ) 
A 4 r 4 + A 5 r 2 + A 6 



(a + Dr 2 + Ai-qr 4 + A 2 r]r 2 + A 3 rj) (a + Dr 2 ) 
Fn,w,t] ( r ) > 



X 



where we introduced the new variable r '■= 



V 



(A.12) 

l ' 2 q. By definition, A\,...,A§ and their g-derivatives are 
-1/2 



uniformly bounded. Since D ^ c > and r ^ erf > on the support of x, we find that for small enough s 
the denominator of the second line of (A.12) is bounded away from zero, uniformly in r, r\, W, and N. We 
therefore conclude that i^jv.W,^ is smooth and its derivatives (in the variable r) are uniformly bounded. 
Using summation by parts, exactly as in (A.8), we get 



-K 



(A.13) 



Here we used that the sum on the left-hand side ranges over a set of size 0(N/W) due to the factor \ m the 
definition of -Fjv.w.iy- Therefore \A.12 ) and (A.13) imply that the first term of (A.10) is given by 



iqx/W 



X(q) 



arj + Dq 2 



Ok 



JL/ 

w\ w 



-K 



(A.14) 



Notice that the error term in ( A.ll[ ) is smaller than in (A.14). Next, we remove the factor \ from the main 
term, exactly as in (A. 11). Plugging this into (A. 10) yields 



S 



1 - \m\ 2 S 



N 



qx/W _ 



1 



qeQ 



arj + Dq 2 



Ok 







1 




w 1 





(A.15) 



We can extend the summation in the main term 

1 



N ^ 



arj + Dq 2 



- E 



}qxjW 



1 



arj + Dq 2 



O 



1 f l(\q\ > irW) 



where the error term on the right-hand side is of order 0(W~ 2 ). Thus we have 



W J R arj + Dq 2 



dq 



( 3 ) = 1 y 

VI- |m| 2 5/a:0 N 4-?.. 



iqx/W 



1 



arj + Dq 2 



O 



K 



J_ / vV 
w\ w 



-K 



O 



1 

w 2 



The main term can be computed by the Poisson summation formula 



1 



E 



a z _|_ n z 



_ 1 V^ e -a|a:+27rfe| 
a — * 
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where a > 0. Thus, 



- E 



jqx/W _ 



<ie = 



ar) + Dq 2 2W^/Da~q 



—y 



exp 



feez 



|a; + /c7V| 



This concludes the proof of (2.37). 



In order to prove (2.381, it suffices to analyse the asymptotics of the expression 

R 



1 Ve-v^wfc 

We consider two cases. If r\ ^ (^f) 2 then R X w*/^ - ^ n tnc otncr hand, if 77 ^ (-^r) 2 we use an integral 
approximation to get 



R = — ^-Ve-^wfc ~ J 



Nrj 



This concludes the proof of (2.38), and hence of Proposition 2. 



A. 2. Higher dimensions: proofs of Lemmas 



8.1 



and 



8.2 



Proof of Lemma 18.11 We follow the argument from the proof of Proposition |2.8| in the previous section, 
and merely sketch the differences. We use the d-dimensional lattices 



Q 



2nW 



L 



T. 



Exactly as in (A. 3 1, we get 
S 



1 - Imps' 



x0 q(EQ 



_L V P ic i- x / w — 
1 



S w (q) 



\m\ 2 S w (q) 



where S w (q) := ^ e_i<? ' 



x/W 



SxO 



(A.16) 



xGT 



Next, the basic properties of Sw listed in Lemma A.l and their proofs, carry over verbatim to the higher- 
dimensional setting. Now (A. 7) reads Sw{o) = 1 — (<7 • Dq)(l + A 2 (q)) whe re A 2 (g) = 0(|q| 2 ) uniformly in 
W. Let x be a smooth bump function on M. d , as in the proof of Proposition 2.8 As in (A. 10), we find 

-K< 



s 



1 - \m\ 2 S 



1 E 
N ^ 



j q -x/w s w{q)x(q) 



O 



1- \m\ 2 S w (q) ' ~ K \W d \W I 



for arbitrary K G N. Here, and for the rest of this proof, the summation in q ranges over the lattice (^^Z) d . 
We split 



S 



1- \m\ 2 S 
where 



.i-O 



N ^ 



x/W 



x(q) 



ar] + q ■ Dq N 



-K 



, (A.17) 



R{q) 



S w {q) 



l-\m\ 2 S w (q) ari + q-Dq 
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Note that, unlike in the proof of Proposition |2.8| we keep the cutoff function \ hi the main term since the 
function (j] + q ■ is not integrable in higher dimensions. 

The main term of (A. 17 1 can be computed using Poisson summation: 



1 

N 



E 



^iq-x/W _ 



{ari) d ' 2 



mj + qDq W d VdetD 



12— d 



x\" " near the origin, we find \\V * tp 

\ 2 



J2( v *^)(^ D ~ x/2 ( x -y+ kL n • (A.i8) 

^ CW~ d . By treating the two cases 



Using that V(x 

7] ^ ( ^) 2 a nd r\ ^ (^r)^ separately, we find exactly as in the last paragraph of the proof of Proposition 
that pLl8t is bounded by CW~ d + C(Nri)- 1 



2.8 



What remains therefore is the estimate of the error term containing R in (|A.17|. To that end, we write 
R{q) 



B A + r]B 2 + r] 2 B 



(arj + (q ■ Dq)(l + A 2 ) + r,A 2 + r] 2 A ) (arj + q ■ Dq) ' 



(A.19) 



where B 0} B 2 , B Al A , A 2 , A' 2 are smooth and bounded functions of q, each of order 0(|g| J ) near the origin 
uniformly in W and 77, where i denotes the subscript of the corresponding function. Using the change of 
variables q — iJrjr it is now easy to see that the error term containing R in (A. 17 1 is bounded by CW~ d . 
This concludes the proof of Lemma 8.1 □ 



Proof of Lemma 18.21 We need a more precise bound on the error term of (A. 17) than the bound CW d 
from the proof of Lemma 8.1 In fact, we claim that 



N 



-K 



rfl 2 
W 1 



W 



(A.20) 



The proof of (A.20) is a rather laborious exercise in Taylor expansion whose details we omit. The basic 
strategy is similar to the analysis of (A. 12), except that we expand Sw up to order d/2 + 2 (instead of 4). 
This completes the proof of Lemma 8.2 □ 



A. 3. Slowly decaying band: proof of Lemma 8.8 and Proposition 



8.9 



We begin by proving the following 



auxiliary result, which gives the relevant asymptotics of Sw For q ^= define 



Kq) 



w 



du 



1 



— cos u ( Wu 



B + O 



W \ 
qN] 



(A.21) 



We also set 6(0) := 0, so that b is continuous. 



Lemma A. 2. Suppose that d = 1 and that (8.6) and (8.7) hold. Then the following are true, 
(i) For any K € N there exists a constant Ck such that for \q\ J? 1 we have 

\S w {q)\ < 



(ii) For \q\ ^ 1 we have 

uniformly in N and W . 



Sw{q) = 1- b{q)\qf + 0(q 2 ) 



(A.22) 
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(in) There is a constant c\ such that 



\SU\\ < 1-ci 



wV 

N 



Proof. Part (i) is proved similarly to (A. 6 1, using summation by parts. 
In order to prove part (ii), we write 



i-s W ( q ) 4e ( i - cos ^))^(^) 



(A.23) 



Let x be a smooth, symmetric bump function satisfying x( x ) — 1 f° r \ x \ ^ 1 arL d xi x ) — f° r \ x \ ^ 2. Write 
X '■= 1 — X- We introduce the splitting 

h = xh + x(h ~ M + h X 



on the right-hand side of (A.23). It is easy to check that the two first terms give a contribution of order 
0(q 2 ). The last term of the splitting gives rise to 



ho 



/, , ,x x{x) (Wx\ , W f. 



1 - cos (^))^w t 



dx + O 



q 

w 2 



l 

A2 



(A.24) 

where the last step follows from a mid-point Riemann sum approximation. Now a change of variables u = qx 
easily yields (|A.22|. 



Part (iii) follows from part (ii) using an argument similar to (5.15). 
PROOF of Lemma 18.81 The claim follows from the bound 



□ 



e, 



c w v-^ 



Sw(q) 



C C 



d s w ( q )i(\ q \^^/0) < 

9 l-|m| 2 5w(g) " 



C(r? 



1//J-1 



1) 



TP 



W N ttq 1 ~ H 2 ^) ^ ^ 

where the first term is the contribution of the low modes \q\ ^ r/ 1 ^ and the second term the con tribu tion 
of the high modes \q\ ^ n 1 ^ , which may be replaced with an integral and estimated using Lemma 
omit the details. 



A.2 



We 

□ 

Proof of Proposition 18.91 We proceed similarly to the proof o f Pro position |2.8[ We choose a cutoff scale 
e, and denote by x the bump function from the proof of Lemma A.2 The scale e satisfies vr 1 ^ « £ C 1, 
and will be chosen by optimizing at the end of the proof. 



We use the expansions (|A.22| and (|3.5|). Thus we find, as in the proof of Proposition 2.8 

S — ) 
,i2c ; 

xO 



1 - ImPS- 



1 y e i?* 

N ^ 



/w _ 



x(g/g) 

m 1 + B\q\f i ' N 



1 Y j e^' w R{q)x{q/e) + 



where x is a smooth bump function as in the proof of Proposition |2.8| and 



w 



(A.25) 



R(q) : = 



S w (q) 



1 



{B-b)\q\P + 0(7?+^ +q 2 



l-\m\ 2 S w {q) a V + B\q\? (a V + b\q\P + 0(r] 2 + n\q\P + q 2 )) (a V + B\q\f>) 
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Note that for q € Q \ {0} we have b(q) ^ c. Using (A. 21 ) we may therefore estimate, as in (A. 12 1, to get 



lj2\R(i)\x(q/ 



e) € 



C_ 
W 



logA/y^- 2 



+ l + £ 2-V/^-l 



Next, the bump function in the main term of (A. 25 ) may be easily removed, and the summation in q extended 

N 

,1/0-1 



to the whole lattice ^^Z, at the expense of an error of order 0(e 1 13 /W). Putting everything together, we 
get 



.5' 



1 - IrnPS 1 



= J_ C iqx/W 1 

,o N J^ W7 ar, + B\q\V 



+ 



w 



-0(W~ 



for some c > 0, where we used (8.10). Setting e := n 1 1 /' 3 and Poisson summation yields 

l™|2 /_\ 1//8 



Wan V B 



fcez 



07/ 
if 



1//3 



x + kN 
W 



o 



1/(8-1 



Now (8.11) follows by noting that by (8.9), under the assumption (8.10), only the term k = is of leading 
order. □ 



B. Multilinear large deviation estimates 

In this appendix we give a generalization of the large deviation estimate of Corollary B.3 [231. The proof is 



simpler and the statement is formulated under the assumption (2.11 ) instead of the stronger subexponcntial 
decay assumption. Moreover, since the current proof does not rely on the Burkholder inequality, it is trivially 
generalizable to arbitrary multilinear estimates. 

Throughout the following we consider random variables X satisfying 

EX — , E\X\ 2 = 1, \\X\\ p < fj, p (B.l) 
for all p with some /i p . Here we set := (ElA^) 1 ^. 

Theorem B.l (Large deviation BOUNDS). Let (X^), (Y^), (a\f^), and (b\ ) be independent fam- 
ilies of random variables, where N € N and i,j — 1,...,N. Suppose that all entries and 1^ are 
independent and satisfy ( |B.1| . 

(i) Suppose that (E;N 2 ) 1/2 < *• Then E t hXi < 

1 /2 

(ii) Suppose that (X^l a y| 2 ) ~< Then Y,,, s a.j.X.X , -< 
(Hi) Suppose that (Y,iJ°>ij\ 2 ) ^ *■ Then <',,-V,V, ~< *■ 

If all of the above random variables depend on an index u and the hypotheses of (i) - (Hi) are uniform in u, 
then so are the conclusions. 
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The rest of this appendix is devoted to the proof of Theorem |B.1| Our proof in fact generalizes trivially 
to arbitrary multilinear estimates for quantities of the form Y]^ i ai 1 _.i k (u)Xi 1 (u) ■ ■ ■ Xi k (u), where the 
star indicates that the summation indices are constrained to be distinct. 

We first recall the following version of the Marcinkiewicz-Zygmund inequality. 



Lemma B.2. Let X\, . . . , Xn be a family of independent random variables each satisfying (B.l I and suppose 
that the family (hi) is deterministic. Then 

1/2 



E^Q < (Cp) 1 ' 2 ^ [Y)b- 



(B.2) 



PROOF. The proof is a simple application of Jensen's inequality. Writing B 2 := X)jI^| 2 j we get, by the 
classical Marcinkiewicz-Zygmund inequality 38 in the first line, that 



< (Cpfl 2 

p 

= (Cp) p/2 B P E 

< (Cp) p/2 B P E 

< (Cp) p ' 2 B p p p . 

Next, we prove the following intermediate result. 



E 



B 2 



1/2 



p/2 



B 2 



□ 



Lemma B.3. Let X\, . . . , Xjy, Yi, . . . , Yn be independent random variables each satisfying (B.l I , and suppose 
that the family (a^ ) is deterministic. Then for all p ^ 2 we have 



E"<.<- V < v < < °pf4 El 

Q>ij | I 



1/2 



Proof. Write 

E a>ijXjYj = E b i Y 3 > := E "•'• V ' • 

Note that (bj) and (YA are independent families. By conditioning on the family (bj), we therefore get from 
Lemma |B.2| and the triangle inequality that 



v o- p/2 V „• 1 



Using Lemma B.2 again, we have 



1/2 



This concludes the proof. 



□ 
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Lemma B.4. Let X\, . . . , Xjy be independent random variables each satisfying (B.l), and suppose that the 
family (ay) is deterministic. Then we have 



^a lj X l X j sC Cp^A^ 



1/2 



Proof. The proof relies on the identity (valid for i ^ j) 

i = E i(< e e J) , 



(B.3) 



where the sum ranges over all partitions of Njy = {1, • ■ ■ , N} into two sets I and J, and Zm :— 2 N 2 is 
independent of i and j. Moreover, we have 



E 1 = 2*-2, 



(B.4) 



1/2 



/UJ=N N 

where the sum ranges over nonempty subsets / and J. Now we may estimate 

E"« A '< A .< < y~ ^ EE a « x *^ < ^ E q^(em 2 ) 

i^j P N IU.J=N N i£l jeJ P N IU.J=N N H^j ' 

where we used that, for any partition I U J — Nn, the families (X,-),- e j and (Xj)j e j are independent, and 
hence the Lemma B.3 is applicable. The claim now follows from (B.4). 



□ 

As remarked above, the proof of Lemma |B.4| may be easily extended to multilinear expressions of the 
form Y"\. a,, ,-, Xj. ■ ■ ■ Xj, . 

We may now complete the proof of Theorem |B.1| 



Proof of Theorem IB. 11 The proof is a simple application of Chebyshev's inequality. Part (i) follows 
from Lemma B.2 part (ii) from Lemma B.4 and part (hi) from Lemma B.3 We give the details for part 
(hi). 



For e > and D > we have 

^aijXiXj \f X * 

i¥=3 



(E 

1/2 



1/2 



Ei 



<: 



E«^^ ^ e/2 Ei 



1/2' 



■ AT - 



-D-l 



iV 



1 /2 

for arbitrary £>. In the second step we used the definition of (X^j l a y | 2 ) ^ with parameters e/2 and 
D + 1. In the last step we used Lemma B.4 by conditioning on (a !3 ). Given e and D, there is a large enough 
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p such that the first term on the last line is bounded by A~ D_1 . Since e and D were arbitrary, the proof is 
complete. 

The claimed uniformity in u in the case that ay and Xi depend on an index u also follows from the above 
estimate. □ 
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